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Foreword 


Number theory is a fascinating branch of mathematics. There are many examples of 
number theoretical problems that are easy to understand but difficult to solve, often 
enough only by use of advanced methods from other mathematical disciplines. This 
might be one of the reasons why number theory is considered to be such an attractive 
field with many connections to other areas of mathematical research. 

In August 2012 the number theory group of the Department of Mathematics at 
Wurzburg University, under the aegis of Jorn Steuding, organized an international 
summer school entitled Four Faces of Number Theory. In the frame of this event 
about fifty participants, mostly PhD students from all over the world, but also a few lo¬ 
cal participants, even undergraduate students, learned in four courses about different 
aspects of modern number theory. These courses highlight a strong interplay between 
number theory and other fields like combinatorics, functional analysis and graph the¬ 
ory. They will be of interest to (under)graduate students aiming to discover various 
aspects of number theory and their relationship with other areas of mathematics. 

Kathrin Bringmann from Cologne gave an introduction to the theory of modu¬ 
lar forms and, in particular, so-called Mock theta-functions, a topic which had been 
untouched for decades but has obtained much attention during the last five years. 

Yann Bugeaud from Strasbourg lectured about expansions of algebraic numbers. 
Despite some recent progress, presented in his essay, questions like ‘does the digit 7 
occur infinitely often in the decimal expansion of square root of two?’ remain very 
far from being answered. Here combinatorics on words and transcendence theory 
are combined to derive new information on the sequence of decimals of algebraic 
numbers and on their continued fraction expansions. 

Titus Hilberdink from Reading lectured about a recent and rather unexpected ap¬ 
proach to extreme values of the Riemann zeta-function by use of (multiplicative) 
Toeplitz matrices and functional analysis. 

Finally, Jurgen Sander from Hildesheim gave an introduction to algebraic graph 
theory and the impact of number theoretical methods on fundamental questions about 
the spectra of graphs and the analogue of the Riemann hypothesis. 

In this volume the reader can find the course notes from this summer school and 
in some places further additional material. Each of these courses is essentially self- 
contained (although a background in number theory and analysis might be useful). 
In all four courses recent research results are included indicating how easily one can 
approach frontiers of current research in number theory by elementary and basic an¬ 
alytic methods. 

The picture on the front page shows the poster created by Nicola Oswald from 
Wurzburg University for the summer school. The editing of the course notes had been 



done by Rasa Steuding from Wurzburg University. The authors are most grateful to 
both of them for their help. Last but not least, we sincerely thank Jom Steuding from 
Wurzburg University for initiating the summer school and the publication of these 
notes as well as for his editorial work. 


The authors, March 2014 
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Asymptotic formulas for modular forms and 
related functions 


Kathrin Bringmann 
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1 Introduction 

In this paper 1 , we aim to describe how “modularity” can be useful for studying the 
asymptotic behavior of arithmetically interesting functions. We do not attempt to 
present all that is known, but rather work with examples to give an idea about basic 
concepts. 

Let us recall the objects of interest. In the words of Barry Mazur, 

“Modular forms are functions on the complex plane that are inordinately symmetric. 
They satisfy so many symmetries that their mere existence seem like accidents. But 

they do exist.” 

The modular forms alluded to in this quote are meromorphic functions on the complex 
upper half-plane H := {r e C;Im(r) > 0} that satisfy (if / is modular of weight 
ke ZforSL 2 (Z)) 

f = (CT + d)t fM 

v(" ^eSL 2 (Z):=j^ eMat 2 (Z);arf-6c = lj. (1.1) 

Moreover, as a technical growth condition, one requires the function to be “meromor¬ 
phic at the cusps”. Note that the transformation law can be generalized to subgroups, 
to include multipliers or half-integral weight. We do not state the specific form of 


1 The research of the author was supported by the Alfried Krupp Prize for Young University Teachers of the 
Krupp Foundation. 










2 


Kathrin Bringmann 


these transformations here, but we will later treat special cases, for example in Theo¬ 
rem 3.1. 

An important property of modular forms is that they have Fourier expansions of 
the form /(r) = a(n)e 2jIulT . This follows from the transformation law (1.1), 

the fact that ( q }) e SL 2 (Z), and the meromorphicity on EL Meromorphicity at the 
cusps now says that a(n ) f 0 for only finitely many n < 0. The coefficients a(n) of¬ 
ten encode interesting arithmetic information, such as the number of representations 
of n by a (positive definite) quadratic form, just to give one of the numerous examples. 

Modular forms play an important role in many areas like physics, representation 
theory, the theory of elliptic curves (in particular the proof of Fermat’s Last Theo¬ 
rem), quadratic forms, and partitions, just to mention a few. Establishing modularity 
is of importance because it provides powerful machineries which can be employed 
to prove important results. For example, identities may be reduced to a finite cal¬ 
culation of Fourier coefficients (Sturm’s Theorem), asymptotic formulas for Fourier 
coefficients can be obtained by Tauberian Theorems or the Circle Method, and con¬ 
gruences may be proven by employing Serre’s theory of p -adic modular forms. In 
this note we are particularly interested in asymptotic and exact formulas for Fourier 
coefficients of various modular ^-series. In Section 2 we consider holomorphic mod¬ 
ular forms, then in Section 3 allow growth in the cusps, in Section 4 turn to mock 
modular forms, and finally treat mixed mock modular forms in Section 5. 


2 Classical modular forms 

In this section we restrict, for simplicity, to forms of even integral weight k for 
SL 2 (Z). For basic facts on modular forms and most of the details skipped in this 
section, we refer the reader to [31]. 

We call a modular form a holomorphic modular form if it is holomorphic on the 
upper half-plane and bounded at 00 . The space of holomorphic modular forms of 
weight k is denoted by If a holomorphic modular form exponentially decays 
towards 00 it is called a cusp form. The associated space is denoted by S Spe¬ 
cial holomorphic modular forms that are not cusp forms are given by the classical 
Eisenstein series and they have very simple explicit Fourier coefficients. 

Definition. Formally define for k e N the Eisenstein series 

Gfe(r) := ^2 0nr + n)~ k , 

m,n€ Z 


where the sum runs through all ( m.n ) e Z 2 \ {(0,0)}. We note that 


G 2 £+l( T ) = 0. 


Theorem 2.1. For k > 4 even, we have that Gk e Mk- 
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Proof, (sketch) 

Step 1: Prove compact convergence. 

Step 2: Apply modular transformations and reorder. □ 


Remark. We also require a normalized version of the Eisenstein series. For this, set 
Too := {(o i);« e Z} and for M = ( a c b d ) € SL 2 (R ),k e Z, and / : H -> C we 
define the Petersson slash operator by 

f\ k M{x) : = (cr + d)~ k f ( ^ + ■ 

\cx + a ) 

Define for k > 4 an even integer 

Ek(r):=\ 

MeT oo\SL 2 (Z) 


where the sum runs through a complete set of representatives of right cosets of Too in 
SL 2 (Z). We have that 


Ek(r) = 


1 

2 m 


G k ( r), 


where £(s) := (Refs) > 1) denotes the Riemann zeta function. Indeed, it 

is not hard to see that a set of representatives of Poo \ SL 2 (Z) may be given by 


j(* 2)«SL ! B;( t ,J) = i}. 

Thus 

E k(r) = \ J2 ( cr + d r k = J2 r ~ k J2 {cx + d) ~ k 

(c,d )=1 ^ ’ r> 1 (c,d)= 1 

= lk) T. (crz + dr)-*. 

' ( c,d)=l 
re N 


Now (cr, dr) runs through Z 2 \ {(0,0)} if r runs through N and (c, d) through Z 2 
under the restriction that (c.d) = 1. This gives the claim. 


We next turn to computing the Fourier expansion of the Eisenstein series. 
Theorem 2.2. We have the Fourier expansion for k > 4 even 

„2jzinz 


2k 


Z./Y % > 

Ek( r) = 1 - — }o k -i(n)e 
Bk 

where cxi(n) := Yl d \ n anc 1 A the kth Bernoulli number, given by the generating 

function 

x „ x " 

7 = 


— 


n> 0 
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Proof. We instead compute the Fourier expansion of G & and then use that for k even 


« t) = <_,)*+. 

Due to the absolute convergence of the Eisenstein series we may reorder 

Gic(z) = ^ (nit + n)~ k = ^ 7J _fe + ^ ^(mr + n) _A 

m^OneZ 


m,ne Z 


= 2£(k) + 2 ^2 E( ,71T + n ) 


-k 


( 1 . 2 ) 


m >0 neZ 


Now it is well-known by the Lipschitz summation formula (cf. pp. 65-72 of [30]) 
that 

, (-27 xi) k 


E< T + «) k = 


(k-1)! 




— \glninx 


neZ v tz>1 

This gives that the second term in (1.2) equals 


2(2ni) k 
(k — \ )l 


ET,‘i 

m> 1 d> 1 


k—l^2nimdr _ 2(27TZ) 


_ d k ~ X 

(k - 1)! ^ ^ 


— \g27timx 


m> 1 dim 


which gives the claim. □ 

Examples. We have the following special cases: 

E 4 ( r) = 1 + 240 ^ c>i(n)e 27linr , 

n> 1 

£ 6 (t) = 1 - 504 Y^°5{n)e 27linx , 

n> 1 

£ 8 (t) = 1 + 480^a 7 (n)e 2jri '" T . 

n> 1 


We next turn to bounding Fourier coefficients of holomorphic modular forms. For 
this we split the space AT/, into an Eisenstein series part and a cuspidal part. 

Theorem 2.3. Assume that k > 4 is even. Then 

Mk = C Ek © Sk- 

Proof. Assume f(x) — ^ n>0 a(n)e 2llinx e Mk- Then 

f -a(0)E k e S k . □ 
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Since the coefficients of the Eisenstein series part were given explicitly in Theo¬ 
rem 2.2, we next turn to bounding coefficients of cups forms. 

Theorem 2.4 (Hecke bound). For /(r) = a(ri)e 2ninx e S & with k > 0, we 

have 

a(n) — O (n 2 ^ . 

Proof. It is not hard to see that the function 

/(t) := Im(r)2[/(r)[ 

is bounded on H^Indeed, one can show that / is invariant under SL 2 CZ) and one 
may then bound / for sufficiently large imaginary part. Using Cauchy’s Theorem, 
we can write for n > 1 

a{n) — e 2ltny f f(x + iy)e~ 2ninx dx , 

Jo 

where y > 0 can be chosen arbitrary. This yields that 

r. 1 _ 

|a(n)| < y~ie 2 ” ny / f(x + iy)dx < cy~^e 27tny 

Jo 

for some constant c > 0 (independent of y). Picking y = - gives 

| a(n)\ < cn-e 271 = O . □ 

Remark. The Ramanujan-Petersson Conjecture predicts that for / e 4 we have for 
any e > 0 the (optimal) estimate 


a(n) <£. e ,f n 2 + ■ 

For integral weight k > 2 this conjecture was proven by Deligne using highly ad¬ 
vanced techniques from algebraic geometry. His method begins with the fact that the 
vector space of cusp forms has a basis of common eigenfunctions of the so-called 
Hecke algebra and that the Fourier coefficients of these forms coincide with the 
eigenvalues. 

Corollary 2.5. Assume k > A is even and /(r) = a(n)e 27Ilnt e M^. Then 

2k / k\ 

a(n) = —a(0)—crjfc-i(n) + O (« 2 ). 


Proof. The claim follows directly by combining Theorem 2.3 and Theorem 2.4. □ 
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Remark. We have that 

n r < a r {n) — ^ d r — n r ^ d~ r < n r 
d\n d\n 

In particular, we have for k > 4 that for / e 

a(n) = 0 (n k ~ 1 ^ 

and if f $ S^, this bound cannot be improved. 

We next turn to defining explicit cusp forms whose construction generalizes that 
of Eisenstein series. 

Definition. For k e N and n e No, formally define the Poincare series 

PkA r) := Y, *** inX \ k M • 

M er 00 \SL 2 (Z) 


Yd~ r = mn r . 

d> 1 


Note that 

P kt0 (x) = 2E k (x). 

A calculation similar to the proof of Theorem 2.1 establishes the modularity of these 
functions. 

Theorem 2.6. For k > 3 the Poincare series Pi, :jl converges uniformly on every 
vertical strip in EL In particular, Pk, n *= A-ffc. and for n > 0 we have Pk, n e S k - 

Poincare series turn out to be useful, as they have explicitly computable Fourier 
expansions and integrating cusp forms against them recovers the Fourier coefficients 
of these forms. 

To precisely state this result, we need the Petersson inner product. To define this, 
note that for f. g e M k 

dxdy 
dp := — 

V 2 

f(.r)W)y k 

are invariant under SF 2 (Z). 

Definition. A subset T C H is called a fundamental domain of SF 2 (Z) if the fol¬ 
lowing hold: 

(i) IF is closed. 

(ii) For every tei there exists M e SF 2 (Z) such that Mr e F. 

(iii) If r and Mr ( M e SF 2 (Z)) are in the interior of F, then M = ±(J J). 

We also require the following explicit fundamental domain. 
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Theorem 2.7. The following is a fundamental domain of SL 2 (Z): 



AT 

Remark. Note that if r = x + iy e T 7 . then y > 


Definition. For / £ and g e Sk, formally define the Petersson inner product by 



Later we also require a regularized version of this inner product. To introduce it, 
denote for T > 0 



Following [7], we define the regularized inner product (f g) reg of forms /, g which 
transform like modular forms of weight k, but may grow at the cusps. To be more 
precise, we let (/, g) reg be the constant term in the Laurent expansion at s = 0 of the 
meromorphic continuation in s of 



if it exists. Note that if / e Mk and g has vanishing constant term, then we have 



(1.4) 


The following theorem may be easily verified. 

Theorem 2.8. The integral (1.3) is absolutely convergent and the following condi¬ 
tions hold: 

(i) ifg) = {g,f)l 

(ii) (fg) is C- linear in f; 

(iii) (f f) > 0 and moreover ( f f)= 0 44- / = 0. 

In particular, integrating against Poincare series yields the Fourier coefficients of 
cusp forms. 

Theorem 2.9 (Petersson coefficient formula). For f{x) = J2m>i a ( m ^e 27timx 6 S k , 


we have 



( ( 4 ( Lp-> a(n > for n > 1. 


for n — 0, 
for n > 1. 


In particular, E^A-Sk with respect to the Petersson scalar product. 
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Proof. We only argue formally and ignore questions of convergence. We have 
[f Pk, n ) = f /0) e 2n:inr \i c My k dfi 

Jjr Meroo\SL 2 (Z) 


£ L 


f\,M(x)e 27linx \kMy K dp. 


Mer 00 \SL 2 (Z) 


-l 


U Me roo\SL 2 (Z) M J- 


f(x)e 2ninx y k dp. 


(1.5) 


Note that UmsTcxAsl^CZ) ^ I s a fundamental domain for the action of Poo on H 
and that f(x)e 2lxinx v k is invariant under the action of P^. Thus we may change 

u 


Meroo\r 


M F into 


{r — x + iy:0 < x < 1 ,y > 0} . 
This gives that (1.5) equals 


p OO pi 

/ / f(x)e- 2nin *y k - 2 dxdy 

Jo Jo 

poo p 1 

= V a(m) / e -Z*(n+m)y y k-2 dy / e 2nUm-n)x dx 

Jo Jo 


Now the claim follows by using that 


f 


e 27tUx dx = 


1 if P = 0, 

0 otherwise, 


pOO 

/ e- l t k - 2 dt = (k-2)\. 
Jo 


□ 


To show that the Poincare series constitute a basis of Sk, we first require the 
Valence formula (see [31] for a proof). 

Theorem 2.10. If f ^ 0 is a meromorphic modular form of weight k e Z, we have 

2tc i 

with p e 


ord(/;oo) + ^ord(/;i) + ^ord(/;p) + ord(/;z) = -^, 

zer\H 

z^i,p (modr) 

where ord(/; oo) = n o if f(x) = J2T=n 0 a ( n )q n with a (no) # 0. 

One can conclude from Theorem 2.10 the following dimension formulas, the 
proof of which is omitted here. 
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Corollary2.11. Fork e No, 


dim M 2 k 



dim Sk 



if k 

# 1 

(mod 

6), 

if k 

= 1 

(mod 

6), 

if k 

#1 

(mod 

6), 

if k 

= 1 

(mod 

6),k > 7, 

if k 

= 1. 




This easily gives the basis property of the Poincare series. 

Corollary 2.12 (Completeness theorem). Let k > 4 even and dk := dim(.Sy). Then 
a basis of Sk is given by 

{Pk,n- tl = 1 , . . . , dk } . 

In particular, Mk has a basis consisting of Eisenstein series and Poincare series. 

Proof Set S' := span{/yi. Pk,d k } C Sk and let f e Sk be such that /_LS 

with respect to the Petersson inner product. Then / has a Fourier expansion of the 
form /(r) = a(m)e 2jIimx . From Corollary 2.11 we know the precise 

values of dk, yielding a contradiction to Theorem 2.10. To be more precise, we have 
for k ^ 2 (mod 12) 

+ 1 > = ord(/; oo) + ^ ord(/; z) > d k + 1. 

zeryn 

For k = 2 (mod 12), we get 

k , 1 

— — dk T 1 T — . 

12 6 

Thus ord(/; oo) = dk + 1 and 

X! ord (/; z ) = y 

zer\n 


+ 1 — 


k 

12 


which is impossible. □ 

We next compute the Fourier coefficients a n {m ) of the Poincare series Pk, n - F° r 
this, we require some special functions. Define the Kloosterman sums by 

• am-\-an 
c , 

a (mod c)* 


where the sum runs over all a (mod c) that are coprime to c and a denotes the 
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multiplicative inverse of a (mod c). Moreover, we let J r be the / -Bessel function 
of order r, defined by 

, n ._ r (-i)* py +2£ 

ru ' ^l\r(l + 1 + r) V2/ 

l> 0 v ’ 

where F (x) denotes the usual gamma-function. 


Theorem 2.13. We have for n e N 


■«■> = (=) 


k -1 
2 


8m,n + 2ni k E c 1 S (n,m: c) Jk-i 


C> 1 



where 


{ 1 if m = n, 
0 otherwise. 


( 1 . 6 ) 


Proof. We again use that a set of representatives of r oo \SL 2 (E) is given by 
{(* *)eSL 2 (Z);(c,J) = lj. 

The contribution for c = 0 is easily seen to give the first summand in ( 1 .6). For c f 0 
we use the identity 

ax + b a 1 

cx + d c c 2 ^ 

and change d h* d + me, where d runs (mod c)* and m e Z. This gives 

c> 1 d (mode)* 

where a is defined by ad = 1 (mod c) and 

T(x) e c2(r+m) (r + m) . 

me Z 

Now the classical Poisson summation formula yields 

J-(x) = J2 a{m)e 27limx , 

me Z 


where 
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with C > 0 arbitrary. For m < 0 we can deform the path of integration up to infinity 
yielding that a(m) = 0 in this case. For m > 0 we make the substitution r = 
ic~ 1 (n/m)2w to get 

a(m ) = r k - 1 C k ~ 1 f C+l °° w -k: e^yfrnn(w-w~ l ) d w ' 

Jc-io o 

The claim follows using the fact that for ji, k > 0 the functions 
t i—» , (t > 0), 


and 


(Re(u;) > 0), 


are inverses of each other with respect to the usual Laplace transform (8.412.2 of 
[23]). □ 


3 Weakly holomorphic modular forms 

We next turn to weakly holomorphic modular forms, which are still holomorphic 
on H, but admit poles at the “cusps”. The Fourier coefficients of such forms grow 
much faster than those of holomorphic forms. Let us in particular describe this in the 
situation of the partition function. 

Recall that a partition of a positive integer n is a nondecreasing sequence of posi¬ 
tive integers (the parts of the partition) whose sum is n. Let p(n ) denote the number 
of partitions of n. For example, the partitions of 4 are 

4, 3+1, 2 + 2, 2+1 + 1, l + l + l + l, 


so that p(A) — 5. The partition function is very rapidly increasing. For example, 

P( 3) = 3, 

P( 4) = 5, 

/>(10) = 42, 
p( 20) = 627, 
p(100) = 190569292. 


A key observation by Euler is the product identity 


P(q) := 1 +J2p(n)q n 

n> 1 


n 

n> 1 


1 

1 -q n ' 
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One can show that for \q\ < 1, the function P is holomorphic. Using Euler’s identity 
one can embed the partition function into the modular world using the Dedekind 
^-function ( q — e 2ltlx ) 

n{x):=q^\\{\-q n ). 

n> 1 

This function is a modular form of weight 1/2. To be more precise, we have the 
following transformation laws (see e.g. [31]). 

Theorem 3.1. We have 

t)(r + 1) = eTiq(r), 

) = vQrtyOO. 

Theorem 3.1 in particular gives the asymptotic behavior of q as r —> 0. To be 
more precise, it implies that 

P(q) ~ J —iretur (t —> 0). 

We moreover note that the coefficients pin) are easily seen to be positive and mono¬ 
tonic. From this, we may conclude the growth behavior of p(n) using a Tauberian 
Theorem due to Ingham [27]. 

Theorem 3.2. Assume that /(r) := q n ° y/„ >fl a(n)q n is a holomorphic function on 
H, satisfying the following conditions: 

(i) For all n e No, we have 

0 < a(n) < a(n + 1). 

(ii) There exist c e C, d el, and N > 0 such that 

/(r) ~ c(-iT)~ d e 2s t K (r -* 0). 



Then 


a{n) 


-nW-De^'J^n (/7 —> oo). 


From this we immediately conclude the growth behavior of the partition function. 

Theorem 3.3. We have 


p(n) 


1 


An \/3 


, e *H 


(n -> oo). 


(1.7) 


We note that the partition function also has the g-hypergeometric series represen¬ 
tation 


Piq) = £ 


q 


n> 0 


iq\q) 


2 ’ 


( 1 . 8 ) 
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where (a: q) n := ]~[yZ 0 (l — aq '). Showing modularity by just using this represen¬ 
tation is still an open problem [3]. 

Rademacher [36], building on work of Hardy and Ramanujan [22], used the Circle 
Method to obtain an exact formula for p(n). To state his result, we let 


I s (x) ■= i s J s (ix ) = ^2 

m> 0 


i 

m\T(m + a + 1) 


m+a 


(1.9) 


be the I -Bessel function of order s. Moreover, with / 12 (A) := (), we define the 
Kloosterman sum _ 

\/k \ - 2nix 

A k(n ) := 2 ^ Xi2(x)e 12 * . 

i x (mod24/:) 

x 2 = l—24 n (mod24A:) 


Note that for k > 0, (h, k) = 1, and Re(z) > 0 we may rewrite the transformation 
law of the partition generating function as 

P ^exp - (,h + iz)^ = bo h , k sfzeT^( z l ~ z )P ^exp [h' + r - 

' ( 1 . 10 ) 

where hh' = — 1 (mod k). Here 


<*>h,k :=exp(7 ris(h,k)), 


( 1 . 11 ) 


with 


and 


■<».*):= E (( 

li (mod k) 



t)) 


((A)) : 


_( x — [x\ — 2 if x e 


G>h,k ~ 


0 if x e Z. 

Sometimes it is also useful to rewrite u>h,k as [33] 

^ — k ^ ni (2— hk— /i) + y 2 {k—k~^)(2h—h'+h 2 h'^ 

^ e —ni{\(k—\)+Yz(k—k- l> ){2h—h'+h 2 h'y) 

Here (denotes the Jacobi symbol. 

Note that we may also write 

A k(n) = 2 ^ M h,k e 

h (mod A:*) 


if h is odd, 
if k is odd. 


(1.12) 


Theorem 3.4. Forn > 1, we have 
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We note that this is a very astonishing identity expressing the integer p(n) as 
an infinite sum of transcendental numbers. Recently Bruinier and Ono [16] found 
a formula expressing p(n) as a finite sum of algebraic numbers. 


Proof. Here we only give some details of the proof, for more see [4]. By Cauchy’s 
Theorem, 


Pin) = ^~ f 
2m J c 


P(q) 


n +1 


c <7 

where C is any path inside the unit circle surrounding 0 counterclockwise. We may 
choose for C the circle centered at 0 with radius p 
(later we let N —> oo). Then 


= exp(— jjj), with A^ > 0 fixed 


/' 


Define 


Pin) = P~ n P (pexp {2nit)) exp (—2nint) dt. 


dh ' k 1 k(k ] + k ) and &h ’ k ' k (k 2 + k ) ’ 


where f are adjacent Farey fractions in the Farey sequence of order N 

(for example, see p. 72 of [4]). From the theory of Farey fractions it is known that 

O' = 1.2) 

1 ^ 1 

k + kj ~ N + I’ 

We decompose the path of integration in paths along the Farey arcs — fK k < f < 
d' kk , where </> = t — p Setting z = k(N~ 2 — if) then yields 


Pin) = 



0 <h<k 
{h,k)=\ 


2ninh \ 

_ — 1 


r 

J-K 


P ^exp + iz) 


exp (—2jiin(j)) df. 


For N -> oo we have that z -> 0, i.e., exp(=jp-(ft + iz)) -> exp We thus 

need to know the behavior of P(q) as q —> exp To find it, we apply (1.10) 

to obtain 


/2j rn\ ( 2ninh\ 

p{n) = exp I — 1 2^ exp|- — I 

V ' 1 <k<N V ' 

0 <h<k 
(h,k)=\ 

xj * z 2 eX P (y^( z_ ' — Z ^) ^ ^ ex P jj eX P (—2nin(j)) df. 
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Now exp+ |)) —» 0 as z -> 0 + . Thus all the terms in P(q) — 1 are 
‘small”. We therefore write 

PW = Er + E 2 ’ 


with 


E ( 2 Jt Yl \ ^—\ ( 

i :=exp (A^J E ex p( 

^ / 1 7 , AT \ 


2ninh 


l<k<N 

0<h<k 

(h,k)=l 


^h.k 


ftff 

J-“ z ^ exp (lS^ z_1 ~ z )) cx p(- 2 « 


— 2 Tiincf)) dcj). 


E 2 : = ex p 


h,k 

2nn 

A^" 


E ex p(“ 


1 <k<N 
0<h<k 
(h,k)=l 


2 izinh 


O->h,k 


[K, k r 

/ 22 

J-K „ 


n 


exp 


( z ' - z Y 

1 2k 


x Jp ^exp — 1 j exp (—2 nincp) dcj). 

Here we only focus on the main term and only note that contributes to the 
error terms, giving 

£ 2 = o(Af-texp(l^))^0 

for N —> oo. The proof is given in [4]. 

Turning to the main term, setting w := N~ J — i<p yields 

E f 2 tzvi \ ^—x / 2ninh\ 

i = cx p\~/ v ^~) E ex p(-—j 


£ Oh,kh,k > 


\<k<N 

0<h<k 

(h,k)=\ 


with 


Here 


, l _ 2 , 

I h k ■= — ik 2 exp (— 2iznN ) / 

J N- 2 -iK. k 


g(w)dw. 


g(w) := exp ^2 jt ^// - 
Using the Residue Theorem, we can write 


w + 


71 


12 k 2 w 


exp 


( 27in\ , 1 

j Ih,k = —k 2 i (Lk ~ Xj — X 2 — X 3 — X 4 — 1 5 — X 6 ) 
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with 

1 \ n \ 

n -24> ! + l2i^) dZ - 

I, := ^ zi exp (2* (n - 1) z + rfz, 

where X and Ij denote the paths of integration in the picture 




We note without a proof that X 2 , 13 , 14 , and X 5 contribute to the error and vanish 
as N —> 00 . Moreover, as e —> 0 


Xi + X6 




12^ 2 r7 


dt =: —2iC 


This gives that 

X(«) = ^2 A k (n)xj/ k (n), 

k> 1 


where ^(n) is defined as in (1.12) and 


1 (f k {n) := —i sfkC k + 2\ZkC* k . 


* ^ 
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To finish the proof, we have to show that 



Inserting the power series expansion for the exponential function, interchanging sum¬ 
mation and integration, and then making a change of variables gives that 



e z z~ s+ 2dz, 


where the loop L is as in the above picture. 
Using the Hankel loop integral formula 



then yields that 



Treating C* k similarly, the claim follows using the series representation of the Bessel 


function (1.9). 


□ 


Corollary 3.5. The asymptotic estimate (1.7) is true. 


Proof. The claim follows immediately from Theorem 3.4 using that 



(x -> oo). 


□ 


4 Mock modular forms 


Next we aim to study coefficients of mock modular forms, which are holomorphic 
parts of harmonic (weak) Maass forms. These are a generalization of modular forms 
which satisfy a transformation law like (1.1) and (weak) growth conditions at the 
cusps, but, instead of being meromorphic, they are annihilated by the weight-/: hy¬ 
perbolic Laplacian (t = x + iy ) 
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We let Ilk denote the space of harmonic Maass forms of weight k. A theory of har¬ 
monic Maass forms was built by Bruinier and Funke [15]. Previously, Niebur [34, 35] 
and Hejhal [24] constructed certain non-holomorphic Poincare series which turn out 
to be examples of harmonic weak Maass forms. Since functions that are holomorphic 
on H are annihilated by A^., weakly holomorphic modular forms are harmonic Maass 
forms. The simplest (non-weakly holomorphic) harmonic weak Maass form is given 
by the non-holomorphic Eisenstein series of weight 2. To be more precise, define 

E 2 (x):= 1 -24£cr 1 («)g". 

n> 1 


Then E 2 is translation invariant, but introduces an error term under modular inver¬ 
sion: 

Ei ( = t 2 E 2 {t) + —. 

V rj 7X1 

This gives that 

E 2 (t) := E 2 {t) -— 

ixy 

is a harmonic weak Maass form of weight 2. 

A further example of a mock modular form of weight | is given by the generating 
function of Hurwitz class numbers [25]. To be more precise, the function 


Hr) : = 


£ H(n)q" + 

n> 0 

«=0,3 (mod4) 



Q(w) 

(-/ (r + w)) 2 


dw 


(1-13) 


is a harmonic Maass forms of weight | on To(4). Here H(n) is the nth Hur¬ 
witz class number, i.e., the number of equivalence classes of quadratic forms of 
discriminant —n, where each class C is counted with multiplicity l/Aut(C) and 
®(w) := Ylnez e 2nin w is the usual weight-^ theta function. We note that these 
class numbers may also be related to so-called over partitions [9]. 

In this paper, we are in particular interested in those harmonic Maass forms J~ for 
which there exist a polynomial Vjr and £ > 0 such that as y —> oo 

T(r)-V T (q~ l ) = o(e- iy ). 

The function is called the principal part of J~. We denote the associated 

space of Maass forms of weight k by H£. 

Using the differential equation satisfied by harmonic Maass forms, it is not hard 
to see that forms in H£ naturally split into a holomorphic and a non-holomorphic 
part: 

jf(t) = jr+( T ) + jr-(T), 
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where 

J- + (x) := ^ a + {n)q n , 

n^>—oc 

J-(t) : = ^a“(«)T(l — k\ Aji\n\y)q n . 

n< 0 

Here 

pOO 

F(a;x) := / e~ t t 0l ~ 1 dt 
J X 

is the incomplete gamma function. The function J~ + is called a mock modular form 
and its coefficients often encode interesting arithmetic information. Each mock mod¬ 
ular form has a hidden companion, its shadow, which is necessary to fully understand 
the mock modular form. The shadow, which is a classical cusp form of weight 2 — k, 
may be obtained from the associated harmonic weak Maass form of weight k by 

applying the differential operator := 2 iy k ^=. A direct calculation gives 
(-H = -(4n) 1 ~ k J2 a ~(-n)n 1 ~ k q’ 1 . 

n> 1 


A space “orthogonal” to H£ can be charaterized in terms of the holomorphic 
differential operator 


D k ~ 1 : = 


\2jzi dz 


k -1 


Bruinier, Ono, and Rhoades [17] showed the following 


Theorem 4.1. If k £ > 2, then the image of the map 

D k ~ x : H 2 _ k -* Ml 

consists of those h e M' k which are orthogonal to cusp forms with respect to the 
regularized inner product ( 1 .4), and which also have constant term 0. 


Further examples of mock modular forms are given by the so-called mock theta 
functions, a collection of 22 g-series including 


m := £ 

n> 0 




J2 a ( n )q n ’ 

n> 0 


which were defined by the visionary Ramanujan in his last letter to Hardy [37]. Note 
that this function agrees with the representation (1.8) of P(q) as a q -hypergeometric 
function up to a simple change of sign. Ramanujan’s last letter to Hardy includes the 
claim that 


(_!)« + ! 
2 Jn 


nr 

V s 


a {n) 




(as n —> 00 ). 
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As typical for his writing, Ramanujan left no proof of this claim. Dragonette, a Ph.D. 
student of Rademacher, then solved this claim in her thesis [19]. Andrews [1] im¬ 
proved upon her work and together they made the following conjecture. 


Conjecture (Andrews, Dragonette). Forn e N, we have 


a(n) = 


JL \ y 

(24 n - 1)5 k>l 


11 

2 


it -J2\n — 1 
12 n 


Note that not even convergence of this exact formula is obvious. In joint work 
with K. Ono, we proved this conjecture [12]. 


Theorem 4.2. The Andrews-Dragonette Conjecture is true. 


We note that S. Garthwaite [21] showed in her thesis a similar result for the co¬ 
efficients of Ramanujan’s mock theta function co(q) defined in (1.14). Later we [13] 
proved exact formulas for coefficients of a generic harmonic weak Maass form. 

The coefficients ot(n) also have some combinatorial interpretations. To describe 
this, recall that the rank of a partition is its largest part minus the number of its parts. 
Dyson [20] introduced this statistic to explain the famous Ramanujan congruences 

p(5n + 4) = 0 (mod 5), 
p{ln + 5) = 0 (mod 7), 
p(lln + 6) = 0 (mod 11). 

More precisely, Dyson conjectured that the partitions of 5 n + 4 (resp. In + 5) form 
5 (resp. 7) groups of equal size when sorted by their ranks modulo 5 (resp. 7). As an 
example, the following table gives the ranks for the partitions of 4. 


partition 

rank 

rank (mod 5) 

4 

4-1 = 3 

3 

3+ 1 

3-2=1 

1 

2 + 2 

2-2 = 0 

0 

2+1 + 1 

2- 3 = -1 

4 

1 + 1 + 1 + 1 

1 - 4 = -3 

2 


Since a direct calculation confirms that such a splitting in residue classes cannot 
hold modulo 11, Dyson postulated the existence of another statistic, which should 
be called the crank and which should explain all of the Ramanujan congruences. 
Dyson’s rank conjecture was solved by Atkin and Swinnerton-Dyer [6]; the crank was 
found by Andrews and Garvan [5]. To study ranks it is natural to consider a generating 
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function. Denoting by N(m,n ) the number of partitions of n of rank m, it is well- 
known that 


i + ("*’«) = 1 + 


n> 1 me Z 

In particular, letting z = — 1, we obtain 


( zc T- q)n ( z V;?) 


1 + Y J iN e {n)-N 0 (n))q n = 1 + £ 


f2> 1 




= /(?), 


where N e (n) (resp. N 0 (n)) denotes the number of partitions of /? of even (resp. odd) 
rank. This yields the combinatorial interpretation of the coefficients a(n) of the mock 
theta function f(q). 

Proof of Theorem 4.2. The idea in [12] is to realize f(q) as the holomorphic part of 
a Maass-Poincare series. Such Poincare series generate the space of harmonic Maass 
forms and have the form 


E 


(cx + d) k f 


( a -^r). 

\cx + a ) 


where T is an appropriate subgroup of SL 2 (Z) and <p is a translation-invariant func¬ 
tion that satisfies the differential equation of a Maass form. Fourier coefficients of 
such Maass-Poincare series are then easily computed using Poisson summation. 

To be more precise, we need to recall work of Zwegers [39] which completes 
/( q ) to a (vector-valued) harmonic weak Maass form. Watson [38] had previously 
obtained (mock) transformations for f(q) and related functions. To state Zwegers’s 
result we require a further mock theta function 


a)(q) 


£ 

n> 0 


n 2n 2 +2n 

H 

(q-q 2 fn +1 


(1.14) 


to build the vector-valued function 

F{r) := (Fo(T),Fi(r),F 2 (r)) r = (q~^f(q), 2q?a) , 2q^a> • 


We moreover require its non-holomorphic companion, a certain “period integral”, 


/ i cx 


(gi(z),go(z),-g2(:)) 
v''-/(z + r) 


■dz, 
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where the following cuspidal theta functions of weight | are defined as 

go(z) := £(-l)» + 

neZ x 7 


gdz)-=-J2{ n + J) 

nez v 7 

g 2 (z) := J2 (n + 

neZ ^ 7 


\\ 3ni(n+lf z 


, 3 7ti(n + Y) z 


Then define the completion of F by 

F{x) := F{x) - G(r). 

Theorem 4.3. We have the following transformation laws for the generators of 
SL 2 (Z): 

0 0 

0 & |F(t), 



F{x), 


where := e « . Moreover, 

A,(?)=0. 

From Theorem 4.3 it is then not hard to conclude that 
F 0 (x) := F 0 (24r)~Go(24r) 

is a harmonic weak Maass form of weight ^ on To(144) with Nebentypus character 
X 12 in) := (^)- Here r 0 (N) :={(“*) e SL 2 (Z);c = 0 (mod A)}. 

Let us next define the precise Maass-Poincare series. For this, we require further 
notation. For matrices (“ b , ) e Fq( 2) with c > 0, define the multiplier 


a b 
c d 


( t~ b 

_ 1 S24 

— ) i-\(—\}\(c+ad+l) e | a+d _ a _|_ Srfc'j Q) -l 


8 ) ^ d.c 


if c = 0, 
if c > 0, 


24c 4 

where a>h,k was given in (1.11). Flere e(x) e 2jllx . Note that this multiplier is 
defined to agree with the automorphy factor of Fa when restricted to Fo(2). 

A key function for building the Maass-Poincare series is the special function (s e 
C, u e M\{0}) 

Msiu) : = \u\-*Mi sgDWs _i (|u|). 
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Here is the standard M-Whittaker function, which satisfies the differential 

equation 


_a*_ 

3 u 2 


7 ■ 


' 1 v 

— — H-h 

, 4 u 




7 = 0 . 


Then set 


»«=-*.(-?)'(-£)■ 


A direct calculation shows that 

From the function <p s we then build the Poincare series 

2 i 

Pi(s\r) :=— Y x(M)~ l {cr + d)~^(p s {Mr) , 

”” Meroo\r 0 (2) 


(1.15) 


where we always choose representations with non-negative lower-left entry. For 
Re(s) > 1, this series converges absolutely and satisfies for M = ( a K) e Fo(2) 
the correct transformation law 

( ax + b \ i 

■y; cr + c l J = X(M)(cx + d)*Pi (s;x). 

Specializing to s — | then formally yields, by (1.15), a function that is annihilated 
by Ai. Convergence is however not guaranteed. To overcome this problem, we 
analytically continue the function via its Fourier expansion. 


Proposition 4.4. We have 




£ + £ rMr - |24 " «-*. 

n >1 n<0 ' 


Here 


m = 


JC \ > 

<24n-l)i^ 




/ll 


7T 


v/24n - 1' 


12k 


, ,A*+i| . ( i(i+(-t)*)\ 

2k {" 4 j , (tv Vl24// - 1| 

Y(n) = ~ -— 2 ^- Z - 


[24/t -ll 4 ^ 


-J i 

2 


12 k 
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Proof. Using Poisson summation, the proof proceeds similarly to the proof of Theo¬ 
rem 2.13. In particular we require the following integral evaluations (see page 357 of 
[24]): 


[ (\—iu\ - ( AnB \ (2jziBu \ 

/ 1-— M_k i -- -r 1 exp -- - + 2niAu 

J — oo \ 1 + t M / 2 ’ 2 V 1 T W 2 / \ 1 + M 2 / 

2 JT 


= r(2s) 


B s 


r(.s-|) ” -2^-2 

4jr 1+J 


+ M 2 

(47r^U8T) 


+ 2jziAuj du 

if 4 > 0, 


(2i-i)r(i+|)r(j-|)' 

(4^|4|)/ 2 ,_ 1 (4 7 r V ^6T) 


if 4 = 0, 


if H < 0. 


Here W V ^ L is the usual (T'-Whittakcr function. The claim then follows from the spe¬ 
cial values of Whittaker functions (y > 0) 

Wi (y) = e~-, 

4 

W| (-y) = e?T , 

Ms (-y) = ^ 


once convergence is established. Here for s e C and u e M\{0} 
W s {u)^\u\^W hMs _ h {\u\). 


Proving convergence is quite involved and requires modifying a classical argu¬ 
ment of Hooley [26]. To use his method, we rewrite the Kloosterman sums as Salie 
sums, which are sums over quadratic congruences. To be more precise, one can show 
that 


(„ - 

k 


Pi(n\k) 

V2Ak 

ip\{n\k) 2ie (^) p 2 (n\k) 

V2Ak j2Ak 


if k is odd, 
if k is even, 


where 

Pi(n;k) := ^ 

x (mod 48 k ) 

*2 = 1—24 n (mod 48/:) 

p 2 (n\k) := 

* (mod 1 2k ) 
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Using the fact that 


Ii 

2 


n\/2\n — 1 
V2k 


ln^\2\n — 11 \ | 24 « — 1|4 

\ 12 k ) ~ V6 k 


(k -> oo) 


it is not hard to complete the proof once we establish the following estimates ((j, l) e 

{(0,1), (0,2), (1,1)}): 


E 

k> 1 

k=j (mod 2) 


Pi (n\k) 
k 


« 124/t - l|z. 


The proof of this estimate follows similarly as in Hooley. □ 

As the Poincare series was constructed such that /3 (/7) agrees with the formula 
conjectured, we are left to show that 

a(n) = f(n). 


For this we prove that the associated Maass forms coincide. The argument is some¬ 
what lenghty and has been later simplified [13]. □ 


5 Mixed mock modular forms 

We next turn to the Fourier coefficients of mixed mock modular forms, which are 
linear combinations of mock modular forms multiplied by modular forms. We note 
that there is no theory of such functions, the reason being that the space of harmonic 
Maass forms is not closed under multiplication (the eigenfunction property is vio¬ 
lated). In particular, there are no corresponding Poincare series known, and those 
were key ingredients for proving an exact formula for the Fourier coefficients of 
f(q). To overcome these problems, we [10] developed an amplied version of the 
Hardy-Ramanujan Circle Method. Before describing the main modifications neces¬ 
sary, we want to give some examples. 

The first example comes from combinatorics. 

Definition. A partition without sequences [2, 32] is a partition without two consecu¬ 
tive numbers. We denote by s{n) the number of partitions of n without sequences. 

For example the partitions without sequences of 5 are given by 

5, 4+1, 3 + 1 + 1, 1 + 1 + 1 + 1 + 1. 


Thus s (5) = 4. 
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We have the generating function [2] 


G (?) := 

n> 0 


H 3 ^ 3 ) oo 

(q 2 -q 2 ) a o 


•*(?). 


where the product in front is a quotient of ^-functions (and thus modular) and 


m := E 
«>0 


(-q-q)n n 2 

7— 


is one of Ramanujan’s third-order mock theta functions. 

A further example comes from the generating function for Euler numbers of cer¬ 
tain moduli spaces [11]. The functions that are of relevance here are ( / e {0,1}) 

fjG) := -gTT H ^ n + 3 ./)4” + ^' = : . (1.16) 

11 (T ' n> o n>0 


The modularity of i] and (1.13) yields that we have a mixed mock modular form. 

A third example comes from Lie superalgebras. Recently, Kac and Wakimoto 
[28] found a character formula for certain si(tn\ l) A -modules. In terms of ^-series 
this can be written in the following way (s e Z): 


2 q 


S_ 

2 




E 

k=(ki,k 2 ,-,km -\)£% m_1 


q ?Z7= i 1 ki(k i + 1) 

E m — 1 / „ ' 

i = 1 5 


From this representation it is not clear at all that one has mixed mock modular forms, 
but the author proved this in joint work with Ken Ono [14]. 

One can also consider specialized character formulas for irreducible highest 
weight si(m\n) A modules. Here an even more complicated class of functions plays 
a role [8], the so-called almost harmonic Maass forms. Loosely speaking, these are 
sums of harmonic Maass forms under iterates of the raising operator R * := 2i + * 
(thus themselves non-harmonic weak Maass forms) multiplied by almost holomor- 
phic modular forms [29], which are functions that transform like modular forms and 
are polynomials in 4 with (weakly) holomorphic coefficients. We call the associated 
holomorphic parts almost mock modular forms. 

There are many more interesting mixed mock modular forms, for example coming 
from certain generating function arising in the theory of black holes [18] or Joyce 
invariants. 

As a special case, we treat the coefficients otj (n) of fj defined in (1.16). For this, 
we require some more notation. We let for k e N, g e Z, and u el 


fk.g ( M ) 


sinh' 


2 / nit 




7tig_\ 

2k ) 

_ kl 

u 2 


if g ^ 0 (mod 2k), 
if g = 0 (mod 2k). 
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Furthermore, we define the Kloosterman sums 

X—X , 2j ri ( k ,, I h'm f 

Kj y i(n,m\k):= i jrji{h,h ,k)e k ' 

0 <h<k 
(h,k)=l 

where h' is defined by hh' = — 1 (mod k) and where the il/ji are certain explicit 
multipliers. Finally, we let 

r. 1 7 

lk, g {n) := J ^ fk, g (|) 1 7 - O' + 1))(1 -m 2 )] (1 - tt 2 ) 4 du. 


Theorem 5.1. For n > 1, the Fourier coefficients aj (//) of fj are given by the 
following exact formula: 


oij(n) = (4 n - O' + 1)) 4 ^2 


K J} o(n,0-,k ) nx 


k> 1 


(^V 4 " ~~ + L) ) 


1 ./ 77 


+ - 7 = (4« - 0 + 1)) 2 X! 
V2 *>1 




/ 3 (jV 4 « - 0 + 1)) 


- ^ ( 4 « ~ O' + !)) * I] I] 


A>1 fe{0,l} 

—k<g<k 

g=l (mod 2) 


Kj,i(n,g 2 \k) 

k 2 


d-k,g(ft )■ 


The above formula is useful as it allows one to determine asymptotic expansions. 
To be more precise, one can show (see [10, 11]) the following. 


Corollary 5.2. The leading asymptotic terms of aj (n) for n —> 00 are 


a jin ) = I —n 2 -« l + O (n 2 )V 

J J \96 327r v ’) 




We note that in contrast to mock modular forms, the shadows of the mixed mock 
modular forms do contribute to the leading asymptotic terms. One could determine 
further polynomial lower order main terms. 

Proof of Theorem 5.1. We only give a sketch of a proof. Details can be found in [11]. 
From (1.13) and the transformation law of the theta function it follows that 


fj 


( 1 . . . \ 3 -x—-v . nih! jri(J + \)h 

-(It + iz) J = 2 _, tyjt(h,h ,k)e 2 * 2 A 



—6 


4^277 




DX))- 
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where 


lj(x) 


-f 




dw. 


(w + x)z 

Using partial fraction decomposition, we obtain 

g 2 h'\ ( 28 0 „ 


J2 e (~ : 

CT (mnrl '■ 


g (mod 2k ) 
g=j (mod 2) 


4 k 


1 


sfx sflnk 2 


£ 


X J — oo 


e 2 k xu f kg (^ li 'j C J u 


where <5o,g = 0 unless g = 0 (mod 2k ) in which case it equals 1 . 

Now the main difference compared to the use of the classical Circle Method is the 
contribution of integrals of the form 


2nb 5 _2nu- 

Zk,g,b( z ) ■= z2 / e k: Jk,g(u)du. 

J —oo 

Similarly to the case of Fourier expansions, we are now interested in their “principal 
parts”. To be more precise, we let for b > 0 and g e Z 

2 nb 5 2 ttm 2 

Jk,g,b(z) ■= e kz zz / e kz f kg ( u )du. 

J-Vb 


One may show the following asymptotic bounds for k < g < k: 


Lemma 5.3. 

(i) Ifb< 0 , then 


Ik,g,b( z )\ < \ z \^ 


( kf 

\g 2 

(1 


ifg # 0, 
ifg = 0. 


(ii) lfb> 0 , then 


dJ-k,g,b(z) — Jk,g,b{z) T £/c,g,&(z) 

with £k,g,b satisfying the bound in (i). 


With this data one may then use the classical Circle Method. 


□ 
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1 Representation of real numbers 

The most classical ways to represent real numbers are by means of their continued 
fraction expansion or their expansion in some integer base, in particular in base two 
or ten. In this text, we consider only these expansions and deliberately ignore /?- 
expansions, Liiroth expansions, Q -Cantor series, etc., as well as the many variations 
of the continued fraction algorithm. 

The first example of a transcendental number (recall that a real number is alge¬ 
braic if it is a root of a nonzero polynomial with integer coefficients and it is tran¬ 
scendental otherwise) was given by Liouville [51, 52] in 1844. He showed that if the 
sequence of partial quotients of an irrational real number grows sufficiently rapidly, 
then this number is transcendental. He mentioned only at the very end of his note the 
now classical example of the series (keeping his notation) 

i | 1 | i i 1 

a + a^ + a^ + ‘" + a l - 2 - i - m + ‘“’ 


where a > 2 is an integer. 
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Let b denote an integer at least equal to 2. Any real number f has a unique 6-ary 
expansion, that is, it can be uniquely written as 


f = LfJ + £ 

l> t 


L£J + 0 • ai«2 ■ • •, 


( 2 . 1 ) 


where [-J denotes the integer part function, the digits ai, 02 ,... are integers from the 

set {0, 1. b — 1} and ctf differs from b — 1 for infinitely many indices l. This 

notation will be kept throughout this text. 

In a seminal paper published in 1909, Entile Borel [21] introduced the notion of 
normal number. 


Definition 1.1. Let b > 2 be an integer. Let £ be a real number whose b -ary expan¬ 
sion is given by (2.1). We say that f is normal to base b if, for every k > 1, every 

finite block of k digits in { 0,1 . b — 1 } occurs with the same frequency 1 /b k , that 

is, if for every k > 1 and every d\ . dk £ { 0 , 1 , — 1 } we have 

#{l : 0 < l < N,ai +1 = dx . a i+k = d k } 1 

lrm - = — 

N^+oo N b k 

The above definition differs from that given by Borel, but is equivalent to it; see 
Chapter 4 of [27] for a proof and further equivalent definitions. 

We reproduce the fundamental theorem proved by Borel in [21]. Throughout 
this text, ‘almost all’ always refers to the Lebesgue measure, unless otherwise speci¬ 
fied. 

Theorem 1.2. Almost all real numbers are normal to every integer base h > 2. 

Despite the fact that normality is a property shared by almost all numbers, we 
do not know a single explicit example of a number normal to every integer base, let 
alone of a number normal to base 2 and to base 3. However, Martin [54] gave in 2001 
a nice and simple explicit construction of a real number normal to no integer base. 

The first explicit example of a real number normal to a given base was given by 
Champernowne [34] in 1933. 

Theorem 1.3. The real number 

0-12345678910111213..., (2.2) 

whose sequence of decimals is the increasing sequence of all positive integers, is 
normal to base ten. 


Further examples also obtained by concatenation of sequences of integers have 
been given subsequently in [35, 37]. In particular, the real number 


0-235711131719232931.. 


(2.3) 



2 Expansions of algebraic numbers 


33 


whose sequence of decimals is the increasing sequence of all prime numbers, is nor¬ 
mal to base ten. This is due to the fact that the sequence of prime numbers does not 
increase too rapidly. However, we still do not know whether the real numbers (2.2) 
and (2.3) are normal to base two. 

Constructions of a completely different type were found by Stoneham [66] and 
Korobov [49]; see Bailey and Crandall [18] for a more general statement which in¬ 
cludes the next theorem. 


Theorem 1.4. Let b and c be coprime integers, both at least equal to 2. Let d > 2 
be an integer. Then, the real numbers 


£ 


1 

cJb cJ 


and 


E 


1 

c d J h c dJ 


are normal to base b. 


Regarding continued fraction expansions, we can as well define a notion of nor¬ 
mal continued fraction expansion using the Gauss measure (see Section 5) and prove 
that the continued fraction expansion 

a = \_a\ + [0;ai,a 2 -] = [_aj H-—-- 


«2 H- 

of almost every real number a is a normal continued fraction expansion. Here, the 
positive integers a\, a 2 ,... are called the partial quotients of a (throughout this text, 
ai denotes either a b -ary digit or a partial quotient, but this should be clear from 
the context; furthermore, we write f for a real number when we consider its b -ary 
expansion and a when we study its continued fraction expansion). In 1981, Adler, 
Keane, and Smorodinsky [13] have constructed a normal 

continued fraction in a similar way as Champernowne did for a normal number. 

Theorem 1.5. Let 1/2,1/3,2/3. 1/4,2/4, 3/4,... be the infinite sequence obtained 
in writing the rational numbers in (0, 1) with denominator 2 , then with denominator 
3, denominator 4, etc., ordered with numerators increasing. Let x \ X 2 X 3 ... be the 
sequence of positive integers constructed by concatenating the partial quotients (we 
choose the continued fraction expansion which does not end with the digit 1) of this 
sequence of rational numbers. Then, the real number 

[0;xi,x 2 ,...] = [0;2,3. 1,2,4,2,1,3,5,...] 

has a normal continued fraction expansion. 

All this shows that the b -ary expansion and the continued fraction expansion of 
a real number taken at random are well understood. But what can be said for a specific 
number, like 1 / 2 , log 2, n, etc.? 
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Actually, not much! We focus our attention on algebraic numbers. Clearly, a real 
number is rational if, and only if, its b -ary expansion is ultimately periodic. Analo¬ 
gously, a real number is quadratic if, and only if, its continued fraction expansion is 
ultimately periodic; see Section 5. The purpose of the present text is to gather what is 
known on the b -ary expansion of an irrational algebraic number and on the continued 
fraction expansion of an algebraic number of degree at least three. It is generally 
believed that all these expansions are normal, and some numerical computation tend 
to support this guess, but we are very, very far from proving such a strong assertion. 
We still do not know whether there is an integer b > 3 such that at least three dif¬ 
ferent digits occur infinitely often in the b -ary expansion of %/2. And whether there 
exist algebraic numbers of degree at least three whose sequence of partial quotients 
is bounded. Actually, it is widely believed that algebraic numbers should share most 
of the properties of almost all real numbers. This is indeed the case from the point 
of view of rational approximation, since Roth’s theorem (see Section 6) asserts that 
algebraic irrational numbers do behave like almost all numbers, in the sense that they 
cannot be approximated by rational numbers at an order greater than 2. 

The present text is organized as follows. Section 2 contains basic results from 
combinatorics on words. The main results on the complexity of algebraic numbers 
are stated in Section 3. They are proved by combining combinatorial transcendence 
criteria given in Section 4 and established in Sections 9 and 10 with a combinatorial 
lemma proved in Section 8. In Sections 5 and 6 we present various auxiliary results 
from the theory of continued fractions and from Diophantine approximation, respec¬ 
tively. Section 7 is devoted to a sketch of the proof of Theorem 4.1 and to a short 
historical discussion. We present in Section 11 another combinatorial transcendence 
criterion for continued fraction expansions, along with its proof. Section 12 briefly 
surveys some refined results which complement Theorem 3.1. Finally, in Section 13, 
we discuss other points of view for measuring the complexity of the b -ary expansion 
of a number. 

A proof of Theorem 4.1 can already be found in the surveys [20, 8] and in the 
monograph [27]. Here, we provide two different proofs. Historical remarks and 
discussion on the various results which have ultimately led to Theorem 4.2 are given 
in [30], 


2 Combinatorics on words and complexity 

In the sequel, we often identify a real number with the infinite sequence of its b- 
ary digits or of its partial quotients. It appears to be convenient to use the point of 
view from combinatorics on words. Throughout, we denote by A a finite or infinite 
set. A finite word over the alphabet A is either the empty word, or a finite string 
(or block) of elements from A. An infinite word over A is an infinite sequence of 
elements from A. 
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For an infinite word w = w\ w; 2 ... over the alphabet A and for any positive 
integer n, we let 

p(n,w,A) := #{w/+i . ..w j+n : j > 0} 

denote the number of distinct strings (or blocks) of length n occurring in w. Obvi¬ 
ously, putting #A — +oo if A is infinite, we have 

1 < pin, w, A) < (#A) n , 

and both inequalities are sharp. Furthermore, the function n i-> p(n, w, A) is non¬ 
decreasing. 

Definition 2.1. An infinite word w = W \ w 2 ... is ultimately periodic if there exist 
positive integers 77 o and T such that 

w„+t — w„, for every 7? > 7?o. 

The word w„ 0 w„ 0 +i ... w„ 0 +t-i is a period of w. If 77o can be chosen equal to 1, 
then w is (purely) periodic, otherwise, w\ ... w no -\ is a preperiod of w. 

We establish a seminal result from Morse and Hedlund [55, 56]. 

Theorem 2.2. Let w be an infinite word over a finite or infinite alphabet A. // w is 
ultimately periodic, then there exists a positive constant C such that p(n. w. A) < C 
for every positive integer n. Otherwise, we have 

p{n + l,w, .A) > pln.'W.A) + 1 for every n > 1, 


thus, 

p(n , w, A) > n + 1 for every n > 1 . 

Proof Throughout the proof, we write p(-, w) instead of p(-, w, A). 

Let w be an ultimately periodic infinite word, and assume that it has a preperiod 
of length r and a period of length s. Fix h = 1 ,..., s and let n be a positive integer. 
For every j > 1 , the block of length n starting at w r +j S +h is the same as the one 
starting at w r+ h. Consequently, there cannot be more than r + s distinct blocks of 
length77, thus, p{n,yv) < r + 5 . 

Write w = w \ W 2 ■ ■ ■ and assume that there is a positive integer 77 0 such 
that p(n 0, w) = p (77o + 1, w). This means that every block of length 77 0 extends 
uniquely to a block of length no + 1. It implies that p(no, w) = p{no + j , w) holds 
for every positive integer j. By the definition of pijU). vv), two among the words 
Wj ... w no+ j-\, j — 1 ,..., p{tio, w) + 1 , are the same. Consequently, there are 

integers k and i with 0 < k < l < p(no,w) and Wk +m — Wi +m for 777 = 1 . no- 

Since every block of length 77 0 extends uniquely to a block of length 77 0 + 1 , this 
gives Wk+m — u; t.+ m for every positive integer m. This proves that the word w is 
ultimately periodic. 
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Consequently, if w is not ultimately periodic, then p(n + l,w) > p(n, w) + 1 
holds for every positive integer n. Then, p{\, w) > 2 and an immediate induction 
show that p(n , w) > n + 1 for every n. The proof of the theorem is complete. □ 

We complement Theorem 2.2 by pointing out that there exist uncountably many 
infinite words w over A = {0, 1} such that 

p(n,w,A) = n + 1, for n > 1 . 

These words are called Sturmian words ; see e.g. [16]. 

To prove that a real number is normal to some given integer base, or has a normal 
continued fraction expansion, is in most cases a far too difficult problem. So we are 
led to consider weaker questions on the sequence of digits (resp. partial quotients), 
including the following ones: 

• Does every digit occur infinitely many times in the b -ary expansion of £? 

• Are there many non-zero digits in the b -ary expansion of £? 

• Is the sequence of partial quotients of a bounded from above? 

• Does the sequence of partial quotients of a tend to infinity? 

We may even consider weaker questions, namely, and this is the point of view 
we adopt until the very last sections, we wish to bound from below the number of 
different blocks in the infinite word composed of the digits of £ (resp. partial quotients 
of O'). 

Let b > 2 be an integer. A natural way to measure the complexity of a real number 
£ whose b -ary expansion is given by (2.1) is to count the number of distinct blocks of 
given length in the infinite word a = aia 2 a 3 ... We set p(n ,%,b) = p(n , a, b), with 
a as above. Clearly, we have 

p(n, b) = #{a j+l a j+2 ...a j+n : j > 0} = p(n, a, {0, 1,..., b - 1}) 


and 

1 < p(n, £, b) < b n , 
where both inequalities are sharp. 

Since the b- ary expansion of a real number is ultimately periodic if, and only if, 
this number is rational, Theorem 2.2 can be restated as follows. 

Theorem 2.3. Let b > 2 be an integer. If the real number £ is irrational, then 

p(n,%,b) > n + 1, for n > 1. 

Otherwise, the sequence ( p(n , £, b)) n > \ is bounded. 

Let a be an irrational real number and write 


a. — [a.\ + [0\ai,a 2 ,...]. 
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Let a denote the infinite word ci\a 2 ■ ■ ■ over the alphabet Z> |. A natural way to 
measure the intrinsic complexity of a is to count the number p(n. a) : = p(n , a, Z> |) 
of distinct blocks of given length n in the word a. 

Since the continued fraction expansion of a real number is ultimately periodic if, 
and only if, this number is quadratic (see Theorem 5.7), Theorem 2.2 can be restated 
as follows. 

Theorem 2.4. Let b > 2 be an integer. If the real number a is irrational and not 
quadratic, then 

p{n, a) > n + 1, for n > 1. 

If the real number a is quadratic, then the sequence {p{n ,a.)) n >\ is bounded. 

We show in the next section that Theorem 2.3 (resp. 2.4) can be improved when 
£ (resp. a) is assumed to be algebraic. 


3 Complexity of algebraic numbers 


As already mentioned, we focus on the digital expansions and on the continued frac¬ 
tion expansion of algebraic numbers. Until the end of the 20th century, it was only 
known that the sequence of partial quotients of an algebraic number cannot grow too 
rapidly; see Section 12. Regarding b- ary expansions, Ferenczi and Mauduit [43] were 
the first to improve the (trivial) lower bound given by Theorem 2.3 for the complexity 
function of the b- ary expansion of an irrational 

algebraic number 6. They showed in 1997 that pin, 9, b) strictly exceeds n + 1 
for every sufficiently large integer n. Actually, as pointed out a few years later by 
Allouche [14], their approach combined with a combinatorial result of Cassaigne 
[32] yields a slightly stronger result, namely that 

lim (p(n, 9, b) — n) = +oo, (2.4) 

n^+oo v 

for any algebraic irrational number 9. 

The estimate (2.4) follows from a good understanding of the combinatorial struc¬ 
ture of Sturmian sequences combined with a combinatorial translation of Ridout’s 
theorem 6.6. The transcendence criterion given in Theorem 4.1 , established in [10, 3], 
yields an improvement of (2.4). 


Theorem 3.1. For any irrational algebraic number 9 and any integer b > 2, we have 


n 


lim 

->+oo 


pin, 9, b) 

-= +oo. 

n 


(2.5) 


Although (2.5) considerably strengthens (2.4), it is still very far from what is 
commonly expected, that is, from confirming that pin, 9, b) — b" holds for every 
positive n when 9 is algebraic irrational. 
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Regarding continued fraction expansions, it was proved in [15] that 
lim (p(n, 9) — n) — +oo, 

W—>- + 0O 

for any algebraic number 9 of degree at least three. This is the continued fraction 
analogue of (2.4). 

Using ideas from [1], the continued fraction analogue of Theorem 3.1 was estab¬ 
lished in [29]. 


Theorem 3.2. For any algebraic number 9 of degree at least three, we have 


n 


lim 

—>•+00 


pin,9) 

-= +oo. 

n 


( 2 . 6 ) 


The main purpose of the present text is to give complete (if one admits Theo¬ 
rem 6.7, whose proof is much too long and involved to be included here) proofs of 
Theorems 3.1 and 3.2. They are established by combining combinatorial transcen¬ 
dence criteria (Theorems 4.1 and 4.2) and a combinatorial lemma (Lemma 8.1). This 
is explained in details at the end of Section 8 . 


4 Combinatorial transcendence criteria 

In this section, we state the combinatorial transcendence criteria which, combined 
with the combinatorial lemma from Section 8, yield Theorems 3.1 and 3.2. 

Throughout, the length of a finite word W over the alphabet A, that is, the number 
of letters composing W, is denoted by \ W\. 

Let a = (be a sequence of elements from A. We say that a satisfies 
Condition (4k) if a is not 

ultimately periodic and if there exist three sequences of finite words, (U n ) n >i, 
(V„)n> i, and (W n )n> t, such that: 

(i) For every n > 1, the word W n U n V n U n is a prefix of the word a. 

(ii) The sequence (| V n \/\U n ) n > i is bounded from above. 

(iii) The sequence (| W n \/\U n \) n >\ is bounded from above. 

(iv) The sequence (\U n \) n >\ is increasing. 

Theorem 4.1. Let b > 2 be an integer. Let a = (af)i> \ be a sequence of elements 
from {0, 1, .... b — 1}. If a satisfies Condition (4k), then the real number 

+oo 

* ■ ^ b* 

1 =1 


is transcendental. 
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Theorem 4.2. Let a = a sequence of positive integers. Let ( Pi/qi)i>\ 

denote the sequence ofconvergents to the real number 

a [0; a\, a 2 , • • ■, at ,...]. 

Assume that the sequence (q f )t> i is bounded. If a satisfies Condition (4fc), then ot 
is transcendental. 

The common tool for the proofs of Theorems 4.1 and 4.2 is a powerful theorem 
from Diophantine approximation theory, the Subspace Theorem; see Section 6. 

Let us comment on Condition (♦) when, for simplicity, the alphabet A is finite 

and has b >2 elements. Take an arbitrary infinite word a \ 02 ... on {0,1. b — 1}. 

Then, by the Schubfachprinzip, for every positive integer m, there exists (at least) one 
finite word U m of length in having (at least) two (possibly overlapping) occurrences in 
the prefix a\U 2 ■ ■ ■ ai,m +m . If, for simplicity, we suppose that these two occurrences 
do not overlap, then there exist finite (or empty) words V m . W m , X m such that 

a\ Ci2 ...ci}jin — W m U m V m U m X m . 

This simple argument gives no additional information on the lengths of W m and V m , 
which a priori can be as large as b m — m. In particular, they can be larger than some 
constant greater than 1 raised to the power the length of U m . 

We demand much more for a sequence a to satisfy Condition (4fr), namely we 
impose that there exists an integer C such that, for infinitely many m , the lengths of 
V m and W m do not exceed C times the length of U m . Such a condition occurs quite 
rarely. 

We end this section with a few comments on de Bruijn words. 

Definition 4.3. Let b >2 and n > 1 be integers. A de Bruijn word of order n over 
an alphabet of cardinality b is a word of length b n + n — 1 in which every block of 
length n occurs exactly once. 

A recent result of Becher and Heiber [19] shows that we can extend de Bruijn 
words. 

Theorem 4.4. Every de Bruijn word of order n over an alphabet with at least three 
letters can be extended to a de Bruijn word of order n + 1. Every de Bruijn word 
of order n over an alphabet with two letters can be extended to a de Bruijn word of 
order n + 2 . 

Theorem 4.4 shows that there exist infinite de Bruijn words obtained as the induc¬ 
tive limit of extended de Bruijn sequences of order n, for each n (when the alphabet 
has at least three letters; for each even n, otherwise). Let b > 2 be an integer. By 
construction, for every m > 1, the shortest prefix of an infinite de Bruijn word having 
two occurrences of a same word of length m has at least b m + m letters if h > 3 and 
at least 2" ! “ 1 + m — 1 letters if b = 2. 
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5 Continued fractions 


In this section, we briefly present classical results on continued fractions which will 
be used in the proofs of Theorems 4.2 and 11.1. We omit most of the proofs and 
refer the reader to a text of van der Poorten [58] and to the books of Bugeaud [22], 
Cassels [33], Hardy and Wright [44], Khintchine [47], Perron [57], and Schmidt [64], 
among many others. 

Let xo, Xi ,... be real numbers with xi, X 2 , ■ ■ . positive. A finite continued frac¬ 
tion denotes any expression of the form 

[x 0 ;xi,x 2 , — x„\ = x 0 H- —- -. 


x 2 H-j“~ 

• • • + — 
X n 


We call any expression of the above form or of the form 


[x 0 ; xi, x 2 ,...] = x 0 + 


1 


X\ + 


1 


1 

X 2 H- 


lint [x 0 ;xi,x 2 ,...,x„] 

n —»-+oo 


a continued fraction, provided that the limit exists. 

Any rational number r has exactly two different continued fraction expansions. 
These are [r] and [r — 1; 1] if r is an integer and, otherwise, one of them reads 
[«o; a \,..., a n -i, a n \ with a n > 2 , and the other one is [ao\ a \,..., a n -i,a n — 1 , 1 ]. 
Any irrational number has a unique expansion in continued fraction. 


Theorem 5.1. Let a — [a$\ai,a 2 , ■ ■ ■] be an irrational number. For l > 1, set 
Pllqi [ciq\ a\, a 2 ,..., af\. Let n be a positive integer. Putting 

P-\ = 1, q-i =0, Po = a o- and qo = 1, 


we have 


and 


Pn — O-nPn— 1 T Pn— 2> dn — a n q n — \ T qn— 2 . 


Pn—ldn Pndn—l — ( 1 ) ■ 

Furthermore, setting a n +\ — [a n + \; n„+ 2 , a n+ j ,...], we have 

PnUn + \ + p n — 1 


a = [a 0 \a \,..., a„,a n +i\ = 


dn&n + l T dn —1 


(2.7) 

( 2 . 8 ) 


(2.9) 


(- 1 )" 

q„a - Pn = -;-, 

QnWn+l \ Qn— 1 


thus 
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and 


1 i 

(a n +\ + 2 )ql q n (q n + q n + 1 ) 


< 


0 ! — 


Pn 

Qn 


1 1 1 

< - < -2 < -2- (2-10) 

dndn+1 fln+ldn Qn 


It follows from (2.9) that any real number whose first partial quotients are 
ao, ci \,..., a n belongs to the interval with endpoints ( p n + p n ~i)/(q n + Qn- 1 ) and 
p n / q n . Consequently, we get from (2.8) an upper bound for the distance between two 
real numbers having the same first partial quotients. 


Corollary 5.2. Let a = [aol ci\,a 2 , ■ ■.] be an irrational number. For l > 0, let qi be 
the denominator of the rational number [ao\ fli, a 2 , • • •. tif]. Let n be a positive inte¬ 
ger and f be a real number such that the first partial quotients of ft are ao, a\,..., a n . 
Then, 


qn(qn+q n - 1 ) q„' 

Under the assumption of Theorem 5.1, the rational number pt/qt is called the 
i-th convergent to a. It follows from (2.7) that the sequence of denominators of 
convergents grows at least exponentially fast. 

Theorem 5.3. Let a = [a o; a\. (i 2 ,.. ■] be an irrational number. For t > 0, let qi be 
the denominator of the rational number [ao', a 1 , r/ 2 • • ■ •. af\. For any positive integers 
l, h, we have 

qt+h > qi{sfl) h 1 

and 

qt < (1 +max{a 1 ,...,fl£}) £ . 


Proof. The first assertion follows via induction on h, since q n+ 2 > q„ +1 + q n > 2 q n 
for every n > 0. The second assertion is an immediate consequence of (2.7). □ 

The next result is sometimes called the mirror formula. 


Theorem 5.4. Let n >2 be an integer and ci \,... ,a n be positive integers. For 
l — 1, . . . ,n, set pi/qi = [0; a\,.. ., af\. Then, 

qn-l rn , 

-= [0; a n ,a n -\,... ,a\], 

Qn 

Proof. We get from (2.7) that 


Qn Qn—2 

-= a n H-, 

qn —1 qn —1 


for n > 1. The theorem then follows by induction. □ 
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An alternative proof of Theorem 5.4 goes as follows. Observe that, if a 0 = 0 and 
/z > 1, then, by (2.7), 



Taking the transpose, we immediately get that 



Yo fV / 0 i y..Y° 1 ^ 

~ \l a n ) \ 1 a n - 1 j \1 cn) 

= [0 1 \ fO 1 \ fO 1 \ = (p n - x q n -A 

\1 a n )\l a n - 1 J"\l \ p n q n )’ 

which gives Theorem 5.4. 

Theorem 5.4 is a particular case of a more general result, which we state below 
after introducing the notion of continuant. 


Definition 5.5. Let m > 1 and a \ . a m be positive integers. The denominator 

of the rational number [0; ai ,..., a m ] is called the continuant of a\,...,a m and is 
usually denoted by K m (a i,..., a m ). 


Theorem 5.6. For any positive integers a \ . a m and any integer k with 1 < k < 

m — 1, vve have 

Km, a m ) = K m ( (ini,... ,a i), (2.11) 


and 


Kk (tii ,.... tlk) • K m —k {tlk + 1. • • ■ j ti m ) 

— K m (a i ,..., a m ) (2.12) 

2 Kk(ai,... ,dk) ■ K m —k(ak+ 1 , ■ ■ ■,ct m ). 


Proof. The first statement is an immediate consequence of Theorem 5.4. Combining 
Km(ti\,..., a m ) = a m K m —\ (a \,..., a m —\) + K m — 2 (a,\,..., ci m — 2 ) 
with (2.11), we get 

Km (til, .... a m ) = tl\ K m ~ 1 ( 02 , ■ ■ ■ , tlm) ~\~ K m ~ 2 ( 773 ,. . . , tl m ), 

which implies (2.12) for k = 1. Let k e {1,2,..., m — 2} be such that 

Km ■ — K m (iz 1 ,..., a m ) 

— Kk(ai ,..., af)' K m -k(ak+i, ■ ■ ■, ti m ) 

T Kk —1 (a 1 ,..., ctk—i )' K m —k—i(ak+ 2 i . . . , Clfpi), 


(2.13) 
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where we have set Kq = \. We then have 

K m — Kk(ci\, . . . , O/c) ■ (cik-\-\K m —k—l{Ok+2^ ■ • ■ ! O m ) + K m —k—2(Ok + 3 ,..., a m )) 

T Kk— i (r?i, ■ • ■, Ok— i) • K m —k ~i (Ok+ 2’ • • • >^m) 

= (^fc+1 K-k 9 • • • > “b l(^l> • • • 5 l)) * K m —k —1 • • • > ^m) 

“1“ (#1 » • • • » &k) * K m —k— 2(^fc+3> ...,0m), 

giving (2.13) for the index k + 1. This shows that (2.13) and, a fortiori, (2.12) hold 
for k = 1,. .., m — 1 . □ 

The ‘only if’ part of the next theorem is due to Euler [39], and the ‘if’ part was 
established by Lagrange [50] in 1770. 

Theorem 5.7. The real irrational number a = [r/ol a\, 02 , ■ ■ ■] has a periodic con¬ 
tinued fraction expansion (that is, there exist integers r > 0 and s > 1 such that 
a n + s = a n for all integers n > r + 1) if and only if a is a quadratic irrationality. 

We display an elementary result on ultimately periodic continued fraction expan¬ 
sions. 


Lemma 5.8. Let 9 be a quadratic real number with ultimately periodic continued 
fraction expansion 

6 = [0; fli,..., a r ,a r +i, ■ ■ ■, a r + s \, 

and denote by ( pt/qi)t>\ the sequence of its convergents. Then, 6 is a root of the 
polynomial 


idr—ldr+s Qrdr+s—l)X (dr—l Pr+s drPr+s—X T Pr—ldr+s 

+ (Pr-lPr+s ~ PrPr+s- 1 ). 


p r q r +s-i)X 

(2.14) 


Proof It follows from (2.9) that 


9 — [0, a i ,..., a r , 0 r -|_i] 


PrO' + Pr -1 _ Pr+sO' + Pr+s -1 
q r 9 T- q r — i dr+s9' T dr+s—i 


where 6' = [a r +T, a r + 2 ,..., a r + s , a r + 1 ]. Consequently, we get 


q/ Pr —1 d_r—\9 Pr+s— 1 dr+s— 

q r 9 - p r q r+s 9 - p r+s 


from which we obtain 


(Pr —1 Qr— \9)(qr+s9 Pr+s) — (Pr+s —1 dr+s— 1 9) {q r 9 Pr)- 


This shows that 9 is a root of (2.14). 


□ 
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We do not claim that the polynomial in (2.14) is the minimal polynomial of 9 over 
the integers. This is indeed not always true, since its coefficients may have common 
prime factors. 

The sequence of partial quotients of an irrational real number a in (0,1) can be 
obtained by iterations of the Gauss map Tg defined by Tg (0) = 0 and Tg (x) = {1 / x) 
for x e (0, 1). Namely, if [0; a\, aj, ■ ■ ■] denotes the continued fraction expansion of 
a, then Tg(a) = [0 \a n+l ,a n+2 , ...] anda„ = [1 /Tg~ l (a)\ for n > 1. 

In the sequel, it is understood that a is a real number in (0,1), whose partial quo¬ 
tients fli(o'), 02 ( 01 ),. .. and convergents p\{a)/q\{a). P 2 {ot) / q 2 {ot),... are written 
a\, a 2 , ■ ■ ■ and p\/q\. Pilq?., ■ ■ respectively, when there is no danger of confusion. 

The map Tg possesses an invariant ergodic probability measure, namely the Gauss 
measure jig, which is absolutely continuous with respect to the Lebesgue measure, 
with density 

dx 

Tgidx) = tg ;—77—V 
(1 + x) log 2 


For every function / in L 1 (p.g) and almost every a. in (0,1), we have (Theorem 3.5.1 
in [36]) 


n 


lim — 

^►+00 n 


E ATja) 


k=0 


— r 

log 2 Jo 


fix) 

1 T x 


dx. 


(2.15) 


For subsequent results in the metric theory of continued fractions, we refer the reader 
to [47] and to [36], 


Definition 5.9. We say that [0; ciy, 02 ,...] is a normal continued fraction, if, for every 
integer k > 1 and every positive integers d \,..., d^, we have 


lim 

N —>+00 


#{./ : 0 < j < N -k,a j+ 1 = d u ...,a j+k = d k } 


N 


= r J/s ' 

Jr/s 


(2.16) 


fig(dx) = flg( Ad u ...,d k ), 


where r/s and r'/s' denote the rational numbers [0: d\ . d k - \. dk] and [0; d \,..., 

d k -i,d k + 1], ordered so that r/s < r'/s', and A d\,...,d k = V/s, r'/s']. 


Let di,.. .,d k be positive integers. It follows from Theorem 5.1 that the set 
A di,...,d k °f rea l numbers a in (0, 1) whose first k partial quotients are d\,.... d k 
is an interval of length 1 /{qkidk + c tk-\ ))■ Applying (2.15) to the function / = 
l Ad dk , we get that for almost every a — [0; a\, a. 2 , ...] in (0,1) the limit defined 
in (2.16) exists and is equal to p>g(A di,...,d k )- Thus, we have established the following 
statement. 


Theorem 5.10. Almost every a in (0, 1) has a normal continued fraction expansion. 

The construction of [13], reproduced in [27], is flexible enough to produce many 
examples of real numbers with a normal continued fraction expansion. 
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6 Diophantine approximation 


In this section, we survey classical results on approximation of real (algebraic) num¬ 
bers by rational numbers. 

We emphasize one of the consequences of Theorem 5.1. 

Theorem 6.1. For every real irrational number there exist infinitely many rational 
numbers p/q with q > 1 and 

1 


Theorem 6.1 is often, and wrongly, attributed to Dirichlet, who proved in 1842 
a stronger result, namely that, under the assumption of Theorem 6.1 and for every 
integer Q > 1, there exist integers p. q with 1 < q < Q and |f - p/q\ < l/(qQ). 
Theorem 6.1 was proved long before 1842. 

An easy covering argument shows that, for almost all numbers, the exponent of q 
in Theorem 6.1 cannot be improved. 



Theorem 6.2. For every e > 0 and almost all real numbers there exist only finitely 
many rational numbers p/q with q > 1 and 




P 

q 


< 


,2+s ' 


(2.17) 


Proof. Without loss of generality, we may assume that £ is in (0,1). If there are 
infinitely many rational numbers p/q with q > 1 satisfying (2.17), then £ belongs to 
the limsup set 



1 




n (0, l). 


The Lebesgue measure of the latter set is, for every Q > 1, at most equal to 


E i 

q>Q 


2 

q 2+E ’ 


which is the tail of a convergent series and thus tends to 0 as Q tends to infinity. This 
proves the theorem. □ 


The case of algebraic numbers is of special interest and has a long history. First, 
we define the (naive) height of an algebraic number. 


Definition 6.3. Let 0 be an irrational, real algebraic number of degree d and let 
adX d + • • • + a\X + flo denotes its minimal polynomial over Z (that is, the integer 
polynomial of lowest positive degree, with coprime coefficients and positive leading 
coefficient, which vanishes at 6). Then, the height H(6) of 0 is defined by 


H(6) := max{|a 0 |, |«t|,..., |a<*|}. 
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We begin by a result of Liouville [5 1 , 52] proved in 1844, and alluded to in Sec¬ 
tion 1. 


Theorem 6.4. Let 9 be an irrational, real algebraic number of degree d and height 
almost H. Then, 



1 

d 2 H( 1 + \e\) d ~ 1 q d 


(2.18) 


for all rational numbers p/q with q > 1. 


Proof Inequality (2.18) is true when \9 — p/q\ > 1. Let p/q be a rational number 
satisfying 1 9 — p/q\ < 1. Denoting by P(X) the minimal defining polynomial of 
9 over Z, we have P(p/q) 0 and | q d P(p/q)\ > 1. By Rolle’s Theorem, there 
exists a real number t lying between 9 and p/q such that 

\P(p/q)\ = | P(0) - P(p/q)\ = \0- p/q\ x \P\t)\. 

Since 1 1 — 9 \ < 1 and 

\P'(t)\ < d 2 H( 1 + \9\) d ~\ 

the combination of these inequalities gives the theorem. □ 

Thue [67] established in 1909 the first significant improvement of Liouville’s re¬ 
sult. There was subsequent progress by Siegel, Dyson and Gelfond, until Roth [61] 
proved in 1955 that, as far as approximation by rational numbers is concerned, the 
irrational, real algebraic numbers do behave like almost all real numbers. 


Theorem 6.5. For every e > 0 and every irrational real algebraic number 9, there 
exist at most finitely many rational numbers p/q with q > 1 and 



1 


(2.19) 


For a prime number l and a non-zero rational number x, we set \x\t l~ u , 
where u e Z is the exponent of i in the prime decomposition of x. Furthermore, we 
set |0[£ = 0. The next theorem, proved by Ridout [59], extends Theorem 6.5. 


Theorem 6.6. Let S be a finite set of prime numbers. Let 9 be a real algebraic 
number. Let e be a positive real number. The inequality 


]~[ \pq\i -minjl, 
leS 1 


e-* 


< 


has only finitely many solutions in non-zero integers p, q. 


1 

2 + £ 


Theorem 6.5 is ineffective, in the sense that its proofs do not allow us to compute 
explicitly an integer qo such that (2.19) has no solution with q greater than q 0 . Nev¬ 
ertheless, we are able to bound explicitly the number of primitive solutions (that is, 











2 Expansions of algebraic numbers 


47 


of solutions in coprime integers p and q) to inequality (2.19). The first result in this 
direction was proved in 1955 by Davenport and Roth [38]; see the proof of Theorem 
12.1 for a recent estimate. 

The Schmidt Subspace Theorem [62, 63, 64] is a powerful multidimensional ex¬ 
tension of the Roth Theorem, with many outstanding applications [20, 26, 69]. We 
quote below a version of it which is suitable for our purpose, but the reader should 
keep in mind that there are more general formulations. 

Theorem 6.7. Let m >2 be an integer. Let S be a finite set of prime numbers. Let 
L i,oo,..., Lnj'Oc be m linearly independent linear forms with real algebraic coeffi¬ 
cients. For any prime l in S, let L\y, ..., L m i be m linearly independent linear 
forms with integer coefficients. Let s be a positive real number. Then, there exist 
an integer T and proper subspaces Si,..., St of Q m such that all the solutions 
x = (x \,..., x m ) in Z m to the inequality 

m m 

nn \Li,i(xi)\(, • | | |Tj ; oo(rk)| — (max{l, |xj|,..., |x m |}) (2.20) 

teS i =i i=i 

are contained in the union S i U ... U St- 

Let us briefly show how Roth’s theorem can be deduced from Theorem 6.7. Let 
6 be a real algebraic number and e be a positive real number. Consider the two 
independent linear forms OX — Y and X. Theorem 6.7 implies that there are integers 
T > 1 , jci, ..., xt, yi, ■ • ■, Yt with (xi, Vi) (0,0) for i = 1 ,..., T , such that, for 
every integer solution ( p , q) to 


\q\■ \q0 — p\ < \q\ E , 

there exists an integer k with 1 < k < T and XkP + ) ! kd — 0. If 0 is irrational, this 
means that there are only finitely many rational solutions to \6 — p/q\ < \q\~ 2 ~ e , 
which is Roth’s theorem. 

Note that (like Theorems 6.5 and 6.6) Theorem 6.7 is ineffective, in the sense that 
its proof does not yield an explicit upper bound for the height of the proper rational 
subspaces containing all the solutions to (2.20). Fortunately, Schmidt [65] was able 
to give an admissible value for the number T of subspaces; see [42] for a common 
generalization of Theorems 6.6 and 6.7, usually called the Quantitative Subspace 
Theorem, and [41] for the current state of the art. 


7 Sketch of proof and historical comments 

Before proving Theorems 3.1 and 3.2, we wish to highlight the main ideas and explain 
how weaker results can be deduced from various statements given in Section 6. We 
focus only on b -ary expansions. 
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The general idea goes as follows. Let us assume that (2.5) does not hold. Then, 
the sequence of digits of our real number satisfies a certain combinatorial property. 
And a suitable transcendence criterion prevents the sequence of digits of an irrational 
algebraic numbers to fulfill the same combinatorial property. 

Let us see how transcendence results listed in Section 6 apply to get a combina¬ 
torial transcendence criterion. We introduce some more notation. Let IT be a finite 
word. For a positive integer l, we write W i: for the word W ... W {l times repeated 
concatenation of the word W) and W°° for the infinite word constructed by concate¬ 
nation of infinitely many copies of W. More generally, for any positive real number 
x, we denote by W x the word W^W', where W' is the prefix of W of length 
[(x — |_A'J )| IL’Il ■ Here, [•] denotes the upper integer part function. In particular, we 
can write 

aabaaaabaaaa = (aabaa) 12 ^ 5 = ( aabaaaabaa) 6 ^ 5 = (aabaa) los10 . 

Let a = be a sequence of elements from A. Let w > 1 be a real number. 

We say that a satisfies Condition (4>) w if a is not 

ultimately periodic and if there exist two sequences of finite words (Z„)„> j, and 
(Wn) n > 1 such that: 

(i) For every n > 1, the word W n Z" ; is a prefix of the word a. 

(ii) The sequence (| W n \/\Z n \) n >\ is bounded from above. 

(iii) The sequence (|Z„|)„>i is increasing. 

We say that a satisfies Condition (4*)oo if it satisfies Condition (♦)„,, for every 
w > 1 . 

Our first result is an application of Theorem 6.4 providing a combinatorial condi¬ 
tion on the sequence b- ary expansion of a real number which ensures that this number 
is trancendental. 

Theorem 7.1. Let b > 2 be an integer. Let a = be a sequence of elements 

from JO, 1. b — 1}. If a satisfies Condition (4k)oo» then the real number 

+oo 

* ^ b l 

1 =1 


is transcendental. 

Proof. Let w > 1 be a real number. By assumption, there exist two sequences of 
finite words, (Z n )„>|, and ( W„) n >\, and an integer C, such that W n \ < C\Z n \ and 
W n Zf is a prefix of a for 77 > 1. Let n > 1 be an integer. In particular, £ is very 
close to the rational number whose b -ary expansion is the eventually periodic word 
W n Z^°. A rapid calculation shows that there is an integer p n such that 


b\ w n\(b\ z ”\ — 1) 
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and 

I l~ln\ 


I- 


Pn 


b I IE; | ( b \Z„ | — 


— b \W n \+ui\Z n \ 

~ ( b \W n | ( b \Z n \ _ n ) 


(\W n \ + w\Z n \)K\W n \ + \Z n \) 


Since \W n \ < C|Z„|, the quantity (| W n \ + w\Z n \) / (\W n \ + \ Z n \) is bounded from 
below by (C + w)/(C + 1). Consequently, we get 


I- 


Pn 


*(i 


1 


(C+iu)/(C +1) 


I \b\Wn\(b\Z n \ — 


( 2 . 21 ) 


Let d be the integer part of (C + in )/( C + 1). Since (2.21) holds for every n > 1, it 
follows from Theorem 6.4 that £ cannot be algebraic of degree < d — 1. Since w can 
be taken arbitrarily large, one deduces that £ must be transcendental. □ 


It is apparent from the proof of Theorem 7.1 that, if instead of Liouville’s theorem 
we use Roth’s (Theorem 6.5) or, even better, Ridout’s Theorem 6.6, then the assump¬ 
tions of Theorem 7.1 can be substantially weakened. Indeed, Roth’s theorem is suffi¬ 
cient to establish the transcendence of £ as soon as the exponent (C + w)/(C + 1) 
strictly exceeds 2, that is, if u; > 2 + C. We explain below how Ridout’s theorem 
yields a much better result. 


Theorem 7.2. Let b > 2 be an integer. Let a = \ be a sequence of elements 

from {0,1 — 1}. Let w >2 be a real number. If a satisfies Condition (4k) w , 

then the real number 


+00 

5 ht 


1= 1 


is transcendental. 


Proof. We keep the notation of the proof of Theorem 7.1, where it is shown that, for 
any n > 1, we have 


s- 


Pn 


b \ W n\(b\ Z n\ — 1) 


< 


1 


b \Wn\ + w\Z n \ ’ 


that is, 

b ~\Wn\ 


I- 


Pn 


b I ( b^ n — 1) 


< 


1 


— b2\W n \+w\Z„ | 


< 


(bl Wn \(bZn\ - 1)) 


(2\W n \ + w\Z n \)K\Wn\ + \Z n \) 


Observe that, for n > 1, 


2\W„\ + w\Z„\ 


(w — 2)\Z n \ 

I W n \ + \z n \ 


>2 + 


in — 2 
C + 1' 


\W n \ + \Z n \ 
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Hence, if we take for S the set of prime divisors of b and set s (w — 2)/(C + 1), 
we conclude that there are infinitely many rational numbers p/q such that 


n \pq\i ■ 

£eS 


min; 1, 


P ) 

q I 


1 

q^' 


By Theorem 6.6, this shows that f is transcendental, since w > 2. □ 


We postpone to Section 9 the proof that the conclusion of Theorem 7.2 remains 
true under the weaker assumption w > 1 (the reader can easily check that this corre¬ 
sponds exactly to Theorem 4.1 ). 


8 A combinatorial lemma 


The purpose of this section is to establish a combinatorial lemma which allows us to 
deduce Theorems 3.1 and 3.2 from Theorems 4.1 and 4.2. 


Lemma 8.1. Let w = w i w; 2 ... be an infinite word over a finite or an infinite alpha¬ 
bet A such that 


. p(n,w,A) 
liminl - < Too. 

n—>+oo 11 

Then, the word w satisfies Condition (4fc) defined in Section 4. 


Proof. By assumption, there exist an integer C >2 and an infinite set A f of positive 
integers such that 

p(n , w, A) < Cn , for every n in A f. (2.22) 

This implies, in particular, that w is written over a finite alphabet. 

Let n be in A f. By (2.22) and the Schubfachprinzip, there exists (at least) one block 
X n of length n having (at least) two occurrences in the prefix of length (C T 1)« of 
w. Thus, there are words W n , Wfi B n and B' n such that W n < W' n and 

Wl • • • W(C-\- 1);; = W n X n B n = W n X n B n . 

If | W n X n | < W' n |. then define V n by the equality W n X n V n = Wfi Observe that 

Wi .. ,W( C + 1)« = W n X n V n X n B' n (2.23) 


and 


\Vn\ + \W n \ ^ 

\Xn\ 


(2.24) 


Set U„ := X n . 
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If \ W^\ < \W n X n \, then, recalling that \ W n \ < \W^\, we define X' n by W' n = 
W n X' n . Since X n B n — X' n X n B' n and \X' n \ < \X n \, the word X' n is a strict prefix of 
X n and X n is the concatenation of at least two copies of X' n and a (possibly empty) 
prefix of X' n . Let t„ be the largest positive integer such that X n begins with 2 t„ copies 
of X' n . Observe that 

2t n \X' n \ + 2\X' n \ > \X' n X n \, 

thus 

n = |X„| < (2 t n + \)\X' n \ < 3t n \X' n \. 

Consequently, W n (X' n n ) 2 is a prefix of w such that 

\X;‘ n \>n /3 

and 

S < 3 --((C + 1)/? — 2\X' n ‘ n |) < 3C + 1. (2.25) 

I A n I 11 

Set U„ : = X' n tn and let V n be the empty word. 

It then follows from (2.23), (2.24), and (2.25) that, for every n in the infinite set 

Af, 

W n U n V n U n is a prefix of w 

with 

\Wn\ + \V n \ < (3C + 1) \Un\. 

This shows that w satisfies Condition (4jk). □ 

We are now in position to deduce Theorems 3.1 and 3.2 from Theorems 4.1 
and 4.2. 

Proof of Theorem 3.1. Let h > 2 be an integer and £ be an irrational real number. 
Assume that p(n , f h) does not tend to infinity with n. It then follows from Lemma 
8.1 that the 

infinite word composed of the digits of £ written in base b satisfies Condition (♦). 
Consequently, Theorem 4.1 asserts that £ cannot be algebraic. By contraposition, we 
get the theorem. □ 

Proof of Theorem 3.2. Let cr be a real number not algebraic of degree at most two. 
Assume that p{n , a) does not tend to infinity with n. It then follows from Lemma 8.1 
that the 

infinite word composed of the partial quotients of £ satisfies Condition (4k). Fur¬ 
thermore, p( 1, a) is finite, thus the sequence of partial quotients of a is bounded, say 
by M. It then follows from Theorem 5.3 that qi < (M + I ) (: for l > 1, hence the 
sequence i is bounded. Consequently, all the hypotheses of Theorem 4.2 are 

satisfied, and one concludes that a cannot be algebraic of degree at least three. By 
contraposition, we get the theorem. □ 
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9 Proof of Theorem 4.1 


We present two proofs of Theorem 4.1, which was originally established in [10]. 

Throughout this section, we set \ U n \ = u„, \ V n \ = v n and \ W n \ — w n , for n > 1. 
We assume that £ is algebraic and we derive a contradiction by a suitable application 
of Theorem 6.7. 


First proof. Let n > 1 be an integer. We observe that the real number f is quite close 
to the rational number whose b -ary expansion is the infinite word W n (U n V n )°°. 
Indeed, there exists an integer p n such that 


}jWn ( foUn+Vn _ J) 


and 


It-?!.!< 


1 


fow n +v n +2u„ ’ 

since £ and f , have the same hrst w n + v n + 2u n digits in their b-ary expansion. 
Consequently, we have 

| jjWn+Un + Vn^ _ JjW n ^ ^ | _ | JjWnQjUn + Vn _ j)| _ pn \ < Jj~Un 


Consider the three linearly independent linear forms with real algebraic coefficients 

L l , O0 {X l ,X 2 ,X 2 ) = X l , 

L 2 , oa {X l ,X 2 ,X i ) = X 2 , 

L3, 00 (X l ,X 2 ,X 3 ) = $X 1 -$X 2 -X 3 . 

Evaluating them on the integer points x„ := (/;>'-'«+««+>’« j j Pn f we get that 

n l^,oc(x n )| <b 2wM . (2.26) 

1 < 7'<3 

For any prime number l dividing b, consider the three linearly independent linear 
forms with integer coefficients 

L U {X 1 ,X 2 ,X 3 ) = X 1 , 

L U (X i,X 2 ,X 3 ) = X 2 , 

L 3 ,e(XuX 2 ,X 3 ) = X 3 . 

We get that 

n n I Lj, e (x„)U<b- 2w "- u "- v ". (2.27) 

l\b 1 < 7'<3 

Since a satisfies Condition (4fr), we have 

u n 

liminf--- > 0. 

n ^"Too W n T U n T Vn 
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It then follows from (2.26) and (2.27) that there exists s > 0 such that 

n i^wi-n n ^ 

l</'<3 t\b 1<7<3 

< ma x{b Wn+Un+v \b w \ p n }~ E , 


for every n > 1 . 

Now we infer from Theorem 6.7 that all the points x„ lie in a finite number of 
proper subspaces of Q 3 . Thus, there exist a non-zero integer triple (zj , z 2 , zf) and an 
infinite set of distinct positive integers A/"i such that 

Zib w„+u n +v n + Z2b w„ +Z3Pti = o, (2.28) 

for any n in M\. 

Dividing (2.28) by b Wn+Un+Vn , we get 

2 > + = 0. (2.29) 

Since u n tends to infinity with n. the sequence (p n /b Wn+Un+Vn ) n > 1 tends to Let¬ 
ting /? tend to infinity along Af\, we then infer from (2.29) that either £ is rational, or 
Z\ — Z 3 = 0. In the latter case, z 2 must be zero, a contradiction. This shows that f 
cannot be algebraic. □ 

Second proof. Here, we follow an alternative approach presented in [2]. 

Let p n and p' n be the rational integers defined by 


Wn ~\~Vn + 2 Un 

E 


1 =1 


ae 

b l 


Pn 

fawn -\~v n + 2 u n 


and 


E 


1= 1 


fyWn+Un 


Observe that there exist integers f n and / w ' such that 


Pn — ^u; w +?; n +2M A 2 “ 1 “ ttw n +v n -\-2u n — 1 b H h <3u)„+u n +w n + fnb Un (2.30) 


and 

P n — Ow„+u„ + a Wn +u n —ib + • • • + a Wn +\b " + f n b ”. (2.31) 

Since, by assumption, 


a w n +u n +Vn+j — a w n +j > f° r j — 1 . u »' 

it follows from (2.30) and (2.31) that p n — p' n is divisible by an integer multiple of 
b Un . Thus, for any prime number t dividing b. the f-adic distance between p„ and 
p' n is very small and we have 


\Pn-Pn\l<K n - 
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Furthermore, it is clear that 

\b Wn+Un $ - p' n \< 1 and \b Wn+Vn+2un % - p n \ < \. 

Consider now the four linearly independent linear forms with real algebraic coef¬ 
ficients 

L hoo (x 1 ,x 2 ,x 3 ,x 4 ) = x l , 

L 2 , 0O {X l ,X 2 ,X 3 ,X A ) = X 2 , 

L 3t00 (X 1 ,X 2 ,X 3 ,X 4 ) = $X l -X 3 , 

L 4 , 00 (X 1 ,X 2 ,X 3 ,X 4 ) = $X 2 -X 4 . 

Evaluating them on the integer points x„ := (/; u; «+ !J «+ 2u ", b Un+w „ , p n , p' n ), we get 
that 

n l^ 7 ,oo(x„)| < b 2wn+Vn+iUn . (2.32) 

l</<4 

For any prime number p dividing h, we consider the four linearly independent 
linear forms with integer coefficients 

L hl (X l ,X 2 ,X 3 ,X 4 ) = X lt 
L 2 , t {X 1 ,X 2 ,X 3 ,X 4 ) = X 2 , 

L 3i e(XuX 2 ,X 3 ,X 4 ) = X 3 , 

L 4 ,e(X 1 ,X 2 ,X 3 ,X 4 ) = X 4 -X 3 . 

We get that 

Y\ n \ L j,d x n)\e < b~ (2wn+Vn+3u,t) b- Un . (2.33) 

t\b 1 <j <4 

Since a satisfies Condition (4fr), we have 

liminf-—-> 0. 

n—>+oo W n + 2 U n + V n 

It then follows from (2.32) and (2.33) that there exists s > 0 such that 


n iL^i-n n 

l</<4 l\b 1<7<4 


< max{b Wn+Vn+2ltn , b Un+Wn , p n , p' n Y 


for every n > 1 . 

We then infer from Theorem 6.7 that all the points x„ lie in a finite number of 
proper subspaces of Q 4 . Thus, there exist a non-zero integer quadruple (z\ ,z 2 ,z 3 , z 4 ) 
and an infinite set of distinct positive integers M\ such that 


for any n in M\. 


Zlb w n +v n +2 u n + ^u n +w n + + z 4 p' n = Q, 


(2.34) 
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Dividing (2.34) by b Wn+Vn +2 “«, W e get 


z\ + z 2 b Un Vn + z 3 


Pn 


fow„+v n +2u n 


+ z 4 b 


Pn 


fou n +w n 


= 0. 


(2.35) 


Recall that u n tends to infinity with n. Thus, the sequences ( p n /b Wn+Vn+2u ") n >\ 
and (p' n /b Un+Wn ) n >i tend to f as n tends to infinity. Letting n tend to infinity along 
Mi, we infer from (2.35) that either £ is rational, or z\ — z 3 = 0 . In the latter case, 
we obtain that £ is rational. This is a contradiction, since the sequence {at )t> 1 is not 
ultimately periodic. Consequently, £ cannot be algebraic. □ 

Also, the Schmidt Subspace Theorem was applied similarly as in the first proof of 
Theorem 4.1 by Troi and Zannier [ 68 ] to establish the transcendence of the number 
Y2mes 2 -m , where S denotes the set of integers which can be represented as sums of 
distinct terms 2 ^ + 1 , where k > 1 . 

Moreover, in his short paper Some suggestions for further research published in 
1984, Mahler [53] suggested explicitly to apply the Schmidt Subspace Theorem ex¬ 
actly as in the first proof of Theorem 4.1 given in Section 9 to investigate whether the 
middle third Cantor set contains irrational algebraic elements or not. More precisely, 
he wrote: 


A possible approach to this question consists in the study of the non- 
homogeneous linear expressions 

\3 Pr+Pr X -3 Pr X - N r \. 

It may be that a p-adic form of Schmidt’s theorem on the rational ap¬ 
proximations of algebraic numbers [ 10 ] holds for such expressions. 

The reference [10] above is Schmidt’s book [64]. 

We end this section by mentioning an application of Theorem 4.1 . Adamczewski 
and Rampersad [12] proved that the binary expansion of an algebraic number contains 
infinitely many occurrences of 7/3-powers. They also established that the ternary 
expansion of an 

algebraic number contains infinitely many occurrences of squares, or infinitely 
many occurrences of one of the blocks 010 or 02120 . 


10 Proof of Theorem 4.2 

We reproduce the proof given in [29]. 

Throughout, the constants implied in -C depend only on a. Assume that the 
sequences (U n ) n > t, ( V n )«> i, and (W n ) n >i occurring in the definition of Condition 
(♦) are fixed. For n > 1, set u n — \U n \, v n — \V n \, and w n — \W n \. We assume 
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that the real number a := [0: a \, a 2 ....] is algebraic of degree at least three. Set 
P -1 = <7o = 1 and q-i = p 0 = 0. 

We observe that a admits infinitely many good quadratic approximants obtained 
by truncating its continued fraction expansion and completing by periodicity. Pre- 

(ft) 

cisely, for every positive integer n, we define the sequence (b k )k> i by 

b { k ] = a h for 1 < h < w„ + u n + v„, 

b wl+h+j(u n +v n ) = a ™n+h for 1 < h < u n + v n and j > 0. 

The sequence (b^)k >t is ultimately periodic, with preperiod W„ and with period 
U n V n . Set 

and note that, since the first w„ + 2 u n + v„ partial quotients of a and of a„ are the 
same, it follows from Corollary 5.2 that 

W-Un\ < q^l+lun+vn- ( 2 ' 36 ) 

Furthermore, Lemma 5.8 asserts that a„ is root of the quadratic polynomial 

Pn(X) {q Wn —iq Wn -\-u n +v n ~ QwnQw n +Un+v n — 1 )-^ ~ 

~ (qwn-lPwn+Un + Vn ~ Q_w n Pw n + u n +v n -1 
+ Pw n — l<}w n +u n +v n ~~ Pw n <lw n +U n +V n — l)2f 
"f~ ( P Wn — 1 P Wn Pwn Pwn-\-Uii~hVfi — l)* 

By (2.10), we have 

I {<lw n — l<lw n +u n +v„~<lw n <lw, l +u n +v n —l)Oi — (q_w n -l Pw n +u n +v n — <lw n Pw n +U n +V n -l)\ 

— blwn — 1 +u H + rr Pw/-i-\-Un~\~Vn I "f~ tfwn\^iwn — 1^ Pwn^rUjy^rVn — ll 

— 2 Qw n +u n + v n 

(2.37) 

and, likewise, 

— l^U)«+M« + Un — QwnQwn+Un+Vn-l)® ~ ( P w n — 1 <lw n +u n + v n ~ Pw n Qw n +u n +v n — 1 )l 

— Qwn+Un+Vn Ww n -l® ~ Pw n — \ I + Q_w n +u n +v„- 11 Q_w n ^ ~ P w n I 

— 2 Qw n Q'Wn +“n +V n ■ 

(2.38) 

Using (2.36), (2.37), and (2.38), we then get 
\Pn(0i)\ = | P n (a) ~ Pn(0C n )\ 

— K^Wn-lQwn+Un+Vn ~ Qw n <iw n +u n +v n ~ 1) ( a ~ a n ) ( a + a n ) 

~ ( Qw n -l P w n +u n +v n — <hv n Pw n +U n +V n -1 
+ Pwn-lQwn+Un + Vn ~ Pw n Qw n +u„ +v„ — 1) ( a — a n ) I 
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I (<hl>n — lQwn +U/7 ~\~Vn Qwn + Wn +Un — l)^ 

— (^Wn — lPwn+Un +v n ~ Quin Pu> n +u n +v n — 0 
5“ (tfwn — lQwn+Un+Vn ~ Quin Qw n +u n +v n — l) 0 * 

~ (Pwn — llwn+Un+Vn ~ PwnQwn+Un+Vn — l) 

5“ (flwn — l ( lwn+Un+'»n ~ Q-W n Q_w n +u n + Vn — 1 )( a n ~ °0 I ' l a — a n I 

<$C |a — a„| • (qw n q Wl \ +Un + Vn + q w lqw n +u n +v n + qw n qw„+u n +v„ W — a «l) 

|a — q', ! |< 7 u;k q Wn +u n +v„ 

7ttJ„ qw n +u n +v n q-w n +2u n +v n ■ (2.39) 

We consider the four linearly independent linear forms 

L 1 (X 1 ,X 2 , X 3 , X 4 ) = a 2 Jfj - a(X 2 + X 3 ) + X 4 , 
L 2 (X 1 ,X 2 ,X 3 ,X 4 ) = aX 1 -X 2 , 

L 3 (X u X 2 ,X 3 ,X 4 ) = aX 1 -X 3 , 

L 4 (X 1 ,X 2 ,X 3 ,X 4 ) = X v 


Evaluating them on the 4-tuple 

x n '- — (qw n -iqw n +u n +v n — qw n qw n +u n +v n -l- 
qw n -\Pw n +u n +v n — qw n Pw n +u n +v n -i, 
Pw n — iqw n +u n + Vn — Pw n quin+Un + v n — U 
Pw n — lPw n +u n +v n ~ Pw„ Pw„+u n +v n —l)> 


it follows from (2.37), (2.38), (2.39), and Theorem 5.3 that 

1 [ l^f( x n)| < ^- q v) n -\-Urt~\~ v n 7 U'n - 2 u )1 — Vn 

i<y<4 

<<c 2~ Un 

ss (n n , , \-Su„/(2w n +U n +V n ) 

\rl U)n ^dy^n ~\~Un ~\~Vn ) ’ 


if n is sufficiently large, where we have set 
M = 1 + limsup 

£—s-+oo 

Since a satisfies Condition (4fr), we have 


and 


log 2 
log M 


u„ 

lim inf--- > 0. 

H^+oo 2 w n + u n + v n 

Consequently, there exists s > 0 such that 

]~[ I Lj (x„) | <5C ( q Wn q Wn +u n +v n ) 
t<y<4 


holds for any sufficiently large integer n. 
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It then follows from Theorem 6.7 that the points x„ lie in a finite union of proper 
linear subspaces of Q 4 . Thus, there exist a non-zero integer quadruple (x\,x 2 , x 2 , X 4 ) 
and an infinite set of distinct positive integers such that 


— l Q_w n +u n + v n qwnQw n +Un + v n — 1) 

+X 2 (q W n — 1 Pu) n + U n +V n tfw n Pw n +U n +V n —\) 

“1“ %3 ( Pw n — 1*7 W n -\-Un~\-Vn Pwn^}wn~\~Un~\~Vn — l) 

~F-t"4 (Pwn — 1 Pwn +U/7 + u« Pwn Pwn~hUn^-Vn — l) ^' 


(2.40) 


for any n in M \. 

• First case: we assume that there exist an integer i and infinitely many integers 
n in M\ with w n = t. 

By extracting an infinite subset of J\f\ if necessary and by considering the real 
number [0: . cit+ 2 , • ■ ■] instead of a, we may without loss of generality assume 

that w„ — i — 0 for any n in J\f\. 

Then, recalling that q-\ = p 0 = 0 and r/o = p~\ — 1, we deduce from (2.40) 
that 

xiqu n +v n —i T - Y 2 Pu n -\-v n —\ x 2 q Un + Vn ~ X4Pu n +v n — 0, (2.41) 

for any n in A/). Observe that (x\ , x 2 ) ^ (0,0), since, otherwise, by letting n tend to 
infinity along J\f\ in (2.41), we would get that the real number a is rational. Dividing 
(2.41) by q Un+Vn ,we obtain 


qun -\~Vji — 1 Pun~\-Vn — 1 qUn~\-Vn — 1 Plln~\~Vn 

X\ - X 2 - *-X 3 — X 4 - 

qu n +v n qu n +v n -i qu n +v„ qu n +v n 

By letting n tend to infinity along J\f\ in (2.42), we get that 


(2.42) 


: = lim 

Afisn—>+oo 


qu n +v n -i 

q U n +v n 


X 3 + X\0i 
X\ + X 2 Ci 


Furthermore, observe that, for any sufficiently large integer n in A/i, we have 


q Q.u n -\-Vn — l 


X 3 + X 4 O' 

x 3 x 4Pun + V,,/Qun+Vn 

P 

qu n +v n 


X\ + x 2 a 

x l + x 2 p Un + Vn -i/q Un +v n -i 


« -, 

q^n^Vn — \qun 

by (2.10). Since the rational number q Un +v n -i/qu„+ Vn is in its reduced form and 
u„ + v n tends to infinity when n tends to infinity along M\, we see that, for every 
positive real number r\ and every positive integer A, there exists a reduced rational 
number a/b such that b > N and |/3 — a/b\ < q/b. This implies that /3 is irrational. 
Consider now the three linearly independent linear forms 


L' 1 (Y 1 ,Y 2 ,Y 3 ) =PY x -Y 2 , L' 2 (Y u Y 2 ,Y 3 ) =aY l -Y 3 , L’ 3 (Y U Y 2 ,Y 3 ) = Y 2 . 
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Evaluating them on the triple (q u „+v„ , q Un +v»-i, Pu n +v n ) with n e A/"i, we infer 
from (2.10) and (2.43) that 


I”[ I Lj(q u „+v n 

1</<3 


<lun+v n - 


l' Pu n +v n ) I < S£ q 


-l 

u n +v n ■ 


It then follows from Theorem 6.7 that the points (?«„+«„, ?»„+«„ -i, Pu„+v„ ) with 
n e A/"i lie in a finite union of proper linear subspaces of Q 3 . Thus, there exist 
a non-zero integer triple (vi, y 2 , J 3 ) and an infinite set of distinct positive integers 
Mi C Mi such that 


y\q u „+v„ + y2qu n +v„-i + ysPu n +v„ = o, (2.44) 

for any n in Mi. Dividing (2.44) by q u „+v n and letting n tend to infinity along M 2 , 
we get 

y 1 + y 2 ft> + y 3 a = 0. (2.45) 

To obtain another equation relating a and /3, we consider the three linearly inde¬ 
pendent linear forms 

L"(Z 1 ,Z 2 ,Z 3 ) = PZ!-Z 2 , L"(Z!,Z 2 ,Z 3 ) = aZ 2 -Z 3 , 

L"(Z 1 ,Z 2 ,Z 3 ) = Z 2 . 


Evaluating them on the triple (9ii„+u„,?M„+u„-i,p« n +«„-i) with n in M, we infer 
from (2.10) and (2.43) that 


\ I I L'j(q Un +v n 

l<y<3 


qu n +v n 


-1, Pu„+v n -i)\ <£ q 


-1 

* 


It then follows from Theorem 6.7 that the points (q u „+v n ■ qu„+v„-i, Pu„+v„-i) with 
n e Mi lie in a finite union of proper linear subspaces of Q 3 . Thus, there exist a non¬ 
zero integer triple (z 1 , z 2 , z 3 ) and an infinite set of distinct positive integers A/3 C A/2 
such that 

+ Z2qu„+v„-l Z^Pun+Vn-l = 0, (2.46) 

for any /z in A/3. Dividing ( 2 . 46 ) by q Un+Vn -\ and letting n tend to infinity along A/3, 
we get 

^ + z 2 + z 3 a = 0. (2.47) 

P 

We infer from (2.45) and (2.47) that 

(z 3 a + z 2 )(y 3 a + yq) = y 2 zi. 

Since ft is irrational, we get from (2.45) and (2.47) that v 3 z 3 7 ^ 0. This shows that cr 
is an algebraic number of degree at most two, which contradicts our assumption that 
a is algebraic of degree at least three. 
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• Second case: extracting an infinite subset A /4 of A/i if necessary, we assume 
that (w n ) ne jq 4 tends to infinity. 

In particular (p Wn /q Wn )„ e aT 4 and (Pw„+u„+v„/qw„+u„+v„)neX 4 both tend to a 
as n tends to infinity. 

We make the following observation. Let a be a letter and U, V, W be three finite 
words (F may be empty) such that a begins with WUVU and a is the last letter of 
W and of UV. Then, writing W = W'a, V — V'a if F is non-empty, and U = U'a 
if F is empty, we see that a begins with W' (aU)V' (aU) if V is non-empty and with 
W'(aU')(aU') if V is empty. Consequently, by iterating this remark if necessary, we 
can assume that for any n in A/ 4 , the last letter of the word U„ V„ differs from the last 
letter of the word W n . Said differently, we have a Wn A dw n +u n + v n for any n in A/ 4 . 

Divide (2.40) by q Wn q Wn+Un+Vn - 1 and write 



We then get 



(2.48) 


for any n in A/ 4 . To shorten the notation, for any l > 1, we put R( ■— ff — pt/qt and 
rewrite (2.48) as 


x l (Qn 1 ) + x 2 {Qni® Rw n +u n +v n ) Rw n +u n +v n - 1 )) 

TY3 (Q« (01 R_ Wn — 1) (a R Wn )) 


+ X4 (Qn( 0 ( Rw„-l)((X Rwn+Un+Vn) ( a Rw n +u n +v n -\)) — 0. 

This yields 


(Qn ~ l)(xi + (x 2 + x 3 )a + x 4 a 2 ) = x 2 Q n Rw n +u n +v n ~ x 2 R w „+ Un +v„-\ 



(2.49) 


Observe that 




(2.50) 


by ( 2 . 10 ). 
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We use (2.49), (2.50) and the assumption that a Wn a Wn+Un+Vn for any n in A /4 
to establish the following claim. 


Claim. We have 


X] + (X 2 + x 3 )a + X 4« 2 = 0. 


Proof of the Claim. If there are arbitrarily large integers n in A /4 such that Q„ > 2 
°r Q n < 1 /2, then the claim follows from (2.49) and (2.50). 

Assume that 1/2 < Q n <2 holds for every large n in A/ 4 . We then derive from 
(2.49) and (2.50) that 

I (Qn ~ 1)0*1 + (X2 + x 3 )a + x 4 Q! 2 )| « | Pw n — \ I « qwl-rfwl- 

If x'i + 0*2 + x 3 )a + X 4 or f 0, then we get 

l0»-l|«^-t<C (2.51) 

On the other hand, observe that, by Theorem 5.4, the rational number Q n is the quo¬ 
tient of the two continued fractions [a Wn+Un+Vn ;a Wn+Un+Vn -i,... ,a x ] and 
Wwn'^wn- 1 . • • Since a Wn+Un+Vn # a Wn , we have either a Wn+Un+Vn -a Wn > 
1 or a Wn — a Wn+Un+Vn > 1. In the former case, we see that 


Qn > 


^Wn +M/j -\~Vn 


Qw„ + 


1 


> 


a w n + 1 


1 + 


1 


Clw n + 


a w„- 2 + 1 

+ 2 


> 1 + 


1 


( d w „ + l)(flu ;„-2 + 2 ) 


O-Wn-2 + 1 

In the latter case, we have 
1 

1 a u>n+~ - —7 

1 > _ + 1 > 1 + - 


1 


> 1 + - 


1 


Qn a w n +u n + + 1 ( fl ii),i-l + ^){Pw n +u n +v„ + 1) 0*iu„ —1 + l)flu; n 

Consequently, in any case, 

I Qn - 1| » a~l min a'^} » a~\ q~l_ v 
Combined with (2.51), this gives 


®w n ^ Qw n Qw n — li 

which implies that « is bounded, a contradiction. This proves the Claim. □ 

Since a is irrational and not quadratic, we deduce from the Claim that xi = X 4 = 
0 and x 2 = — x 3 . Then, x 2 is non-zero and, by (2.40), we have, for any n in A/ 4 , 


t}w n -l Pw n +U n +V n Qw n Pw, t +u n +v n -l — Pw n -iq_w n +u n +v n Pw n qw n +u n +v n -\- 
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Thus, the polynomial P n (X) can be simply expressed as 

Pn(X) (q Wn -iq Wn +u n + v n ~ qw n qw n +Un+v n — l)X 

— —iqw n — l Pw n +u n +v n — qw n Pw n +u„+v n -l)X 
+ (Pwn — lPwn+Un + Vn ~ Pw n Pw n +u n +v n — l)- 

Consider now the three linearly independent linear forms 

L'"(T U T 2 , T 3 ) = ol 2 T x - 2clT 2 + T 3 , 

L 2 (Ti, T 2 , T 3 ) = aTi - T 2 , 

L 3 (Ti, T 2 , T 3 ) = Ti. 

Evaluating them on the triple 

X n ■ (quin — 1 q 11 - n ' U H ■' Vr/ qwn^lwn ~\~ 1 ln + ~ 1 ’ 

qw n — \Pw n +u n + v n ~ qw n Pw n +u n + v n -l, 

Pw n — 1 Pw„+u n + v n ~ Pw„ Pw n +u n + v n -l), 
for n in A/ 4 , it follows from (2.37) and (2.39) that 

FI I Lj ( x ,;)l "4C qw„ qw„+u n + v n qJfi+lUn+Vn ^ ^ w n qw„+u n +v n ) S , 
l</<3 

with the same e as above, if n is sufficiently large. 

We then deduce from Theorem 6.7 that the points x ' n , n e A/ 4 , lie in a finite union 
of proper linear subspaces of Q 3 . Thus, there exist a non-zero integer triple (t\ , t 2 , t 3 ) 
and an infinite set of distinct positive integers A /5 included in A /4 such that 

h(q w n —iqw n +u n +v n qw n qw n +u n +v n —i) 

~\~^2 {qw n — lPw n +u n +v n — qw„ Pw„+u n +v„-l) (2.52) 

"T f 3 (Pwn —1 Pwti+Un +~ Pw n Pw n +u n +v n — l) = 
for any n in A/5. 

We proceed exactly as above. Divide (2.52) by q Wn q Wn +u n +v n -i and set 
Qn - = (qwn-iqu>n+Un+Vn)/(qw n qw n +Un+Vn — l)■ 


We then get 


h(Qn ~ 1) + hyQn 

+ hi Qn 


>n+Un+Vn -1 \ 
'n “Mi/i —1 / 


Pw n +u n +v n _ Pw n 
qwn+Un+Vn Qw 
Pw n — 1 Pvin+Un+Vn 

quin — 1 qnjfi -\-Un ~\~Vn qV)n 4 Wn “fU/i +— 


Pwn Pwn+Un+Vn — 


t) = 


(2.53) 


for any n in A/'s. We argue as after (2.48). Since p w Jq Wn and 
tend to a as 77 tends to infinity along A/ 5 , we derive from (2.53) that 

?i + t 2 a + t 3 a 2 — 0, 


a contradiction since a is irrational and not quadratic. Consequently, a must be tran¬ 
scendental. This concludes the proof of the theorem. □ 
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11 A transcendence criterion for quasi-palindromic 

continued fractions 

In this section, we present another combinatorial transcendence criterion for contin¬ 
ued fractions which was established in [29], based on ideas from [5]. 

For a finite word W : = Wi... Wk, we denote by W Wk ■.. W\ its mirror 
image. The finite word W is called a palindrome if W = W. 

Let a = (<2 £)£>i be a sequence of elements from A. We say that a satisfies 
Condition (4k) if a is not 

ultimately periodic and if there exist three sequences of finite words (U n )„>i, 
(V n ) n > t, and (W n ) n >i such that: 

(i) For every n > 1, the word W„ U n V n U „ is a prefix of the word a. 

(ii) The sequence (| V n \/\U n |)„>i is bounded from above. 

(iii) The sequence ([ W„\/\U n \) n >i is bounded from above. 

(iv) The sequence (|£/„|) n >i is increasing. 

Theorem 11.1. Let a — be a sequence of positive integers. Let ( pi/qi)t>\ 

denote the sequence of convergents to the real number 

a [0: fli, <32. ai,...]. 

Assume that the sequence (q f )i> | is bounded. If a satisfies Condition (4k), then ct 
is transcendental. 

A slight modification of the proof of Theorem 11.1 allows us to remove the as¬ 
sumption on the growth of the sequence i, provided that a stronger condition 

than Condition (4k) is satisfied. 

Theorem 11.2. Let a = (ar)^> i be a sequence of positive integers and set 

a := [0; ai, < 32 ,..., ar,.. ■]■ 

Assume that a — is not eventually periodic. If there are arbitrarily large 

integers l such that the word a ] . ■ - Uf is a palindrome, then oi is transcendental. 

We leave to the reader the proof of Theorem 11.2, established in [5], and establish 
Theorem 11.1. 

Proof. Throughout, the constants implied in -C are absolute. 

Assume that the sequences ( U„) n >i, and (W„) n >i are fixed. Set w„ = 

\W n \, Un — \U n \ and v„ — \V n \, for n > 1. Assume that the real number a : = 
[0; a\, « 2 , • ■ •] is algebraic of degree at least three. 

For n > 1, consider the rational number P„/ Q n defined by 
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and denote by P„/ Q' n the last convergent to P n / Q n which is different from P n / Qn- 
Since the first w n + 2i<„ + v n partial quotients of a and P n / Qn coincide, we deduce 
from Corollary 5.2 that 

\QnOC — P n \ < Qn(l w ^+2un+v n ’ I Qn a ~ I < Qn^w^+lun +v n ' (2-54) 

Furthermore, it follows from Theorem 5.4 that the first w„ + 2 u n + v„ 
partial quotients of a and of Q' n / Q n coincide, thus 

\Q n a - Q' n \ < Qnq~ 2 n+Un , (2.55) 

again by Corollary 5.2, and, by Theorem 5.6, 


Qn < ^QwnQwn +2u n +v n < 2q Wn + u „ Qw„ +2u n +v n ■ 

Since 

OiiQnOi ~ Pn) ~ (! Q' n a ~ P') = OiQ n (« ~ ~ Qn (« ~ 

it follows from (2.54), (2.55), and (2.56) that 

W 2 Qn -OLQ' n -aP„ + P' n \ « Q n q^ +Un q^ n + 2lln +v n + Qn' 

« Qn'- 


(2.56) 



(2.57) 


Consider the four linearly independent linear forms with algebraic coefficients 

L l (X 1 ,X 2 ,X 3 ,X 4 ) = a 2 X x -aX 2 -aX 3 + X A , 

L 2 (X ], X 2 , X 3 , X 4 ) = aX 2 - X A , 

L 3 (X U X 2 ,X 3 ,X 4 ) =aX 1 -X 2 , 

L 4 (X 1 ,X 2 ,X 3 ,X 4 ) = X 2 . 


We deduce from (2.54), (2.55), (2.56), and (2.57) that 


]~[ I Lj(Qn, Q„, Pn, P n )\ ^ Ql <lwl+2u„+v n <lwl+u„ 

I <7 <4 


«: q 


w„ 


q 


-2 


By combining Theorems 5.3 and 5.6 with (2.56), we have 


Quin qw n +u n 2 


« Qn 


—8u„/{2w n +2u n +v n ) 


if n is sufficiently large, where we have set 


M = 1 + limsup 

£—>•+00 


and 


log 2 
log M 
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Since a satisfies Condition (4), we have 

u„ 

liminf--- > 0. 

ft—>-+00 ^Wyi H - H - ^ft 

Consequently, there exists s > 0 such that 

11 \ L j(Qn,Q'n’Pn,P')\«Q7> 

1<7<4 


for every sufficiently large n. 

It then follows from Theorem 6.7 that the points ( Q n , Q' n , P n , P' n ) lie in a fi¬ 
nite number of proper subspaces of Q 4 . Thus, there exist a non-zero integer 4-tuple 
(x\, x 2 , X 3 , X 4 ) and an infinite set of distinct positive integers such that 


X 1 Qn + X 2 Q' n + *3 P n + X4 P„ — 0 , 

for any n in N \. Dividing by Q n , we obtain 

Q' P n P' n Q' n _ 

Xl 2 eT 4 a'eT" 

By letting n tend to infinity along Ad, we infer from (2.55) and (2.56) that 

x\ + ( x 2 + Xi)a + x 4 a 2 = 0. 


(2.58) 


Since (xi, X 2 , x 3 , x 4 ) 7 ^ (0,0,0,0) and since a is irrational and not quadratic, we 
have xi = x 4 = 0 and X 2 = —X 3 . Then, (2.58) implies that 

Q'n = Pn- 

for every n in M\ . Thus, for 11 in , we have 

\u 2 Q n -2aQ' n +P' n \ « Q~\ (2.59) 

Consider now the three linearly independent linear forms 

L\{X x , X 2 , X 3 ) = c/ 2 X\ - 201 X 2 + X 3 , 

L' 2 (X 1 ,X 2 ,X 3 ) = aX 2 -X 3 , 

L' 3 (X i,X 2 ,X 3 ) = X 1 . 

Evaluating them on the triple ( Q n , Q' n , P' n ) for n in M \, it follows from (2.54), (2.56) 
and (2.59) that 


11 Wj{Qn,Q' n , 

l</<3 




Ai < hv n < iw n +2u n + v n ^ tfu) n *lw n +u n ^ Qn 


-s/2 


with the same s as above, if n is sufficiently large. 
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It then follows from Theorem 6.7 that the points ( Q n , Q' n - Pn ) he in a finite num¬ 
ber of proper subspaces of Q 3 . Thus, there exist a non-zero integer triple ( v' i , >' 2 - >’ 3 ) 
and an infinite set of distinct positive integers A4 such that 

y\Qn + yiQ'n + y^P'n — 0 ’ (2.60) 

for any 11 in J\f 2 . 

Dividing (2.60) by Q„, we get 

P n P'n Pn „ 

Ji + J2— + V3— • — = 0 . (2.61) 

\ln \l n 

By letting n tend to infinity along M 2 , it thus follows from (2.61) that 

>'1 + >'2« + T3O! 2 = 0. 

Since (y\, y 2 , >'3) is a non-zero triple of integers, we have reached a contradiction. 
Consequently, the real number a is transcendental. This completes the proof of the 
theorem. □ 


12 Complements 


We collect in this section several results which complement Theorems 3.1 and 3.2. 
The common tool for their proofs (which we omit or just sketch) is the Quantitative 
Subspace Theorem, that is, a theorem which provides an explicit upper bound for the 
number T of exceptional subspaces in Theorem 6.7. 

We have mentioned at the beginning of Section 3 that the sequence of partial 
quotients of an algebraic irrational number 6 cannot grow too rapidly. More precisely, 
it can be derived from Roth’s Theorem 6.5 that 


log log q n 
lim - 

n —»-+oc Tl 


= 0 , 


(2.62) 


where (pi/qi)i> 1 denotes the sequence of convergents to 9. This is left as an exer¬ 
cise. The use of a quantitative form of Theorem 6.5 allowed Davenport and Roth [38] 
to improve (2.62). Their result was subsequently strengthen [4, 25] as follows. 


Theorem 12.1. Let 9 be an irrational, real algebraic number and let (p n /qn)n> 1 
denote the sequence of its convergents. Then, for any e > 0, there exists a constant c, 
depending only on 9 and e, such that 

log log q n < cn 2/3+s . 


Proof. We briefly sketch the proof of a slightly weaker result. Let d be the degree of 
9. By Theorem 6.4, there exists an integer n 0 such that 
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for n > «(). Combined with Theorem 5.1, this gives 

q n + 1 < qi, for n >n 0 . (2.63) 


On the other hand, a quantitative form of Theorem 6.5 (see e.g. [40]) asserts that 
there exists a positive number r]o < 1/5, depending only on 9, such that for every i] 
with 0 < q < rjo, the inequality 



1 

q 2+r > ’ 


has at most q 4 rational solutions p/q with p and q co prime and q > . 

Consequently, for every n > 8/ q. with at most q~ 4 exceptions, we have 


q n +1 < q 


1 +ri 
n 


(2.64) 


Let N > (8 /q) 2 be a large integer and set h := [ y/N ~\. We deduce from (2.63) and 
(2.64) that 

log qN _ log qN ^ log qN—i ^ logg^+t 
log qh log qN— t log qpj —2 log q h 

< (1 + /d r4 , 

whence 

log log qN log log q h < N q + q~ 4 (\og d). 

Observe that it follows from (2.62) that 

log log q h < N 1/2 , 

when N is sufficiently large. Choosing q = A^ - ^ 5 , we obtain the upper bound 


log log qN < log log qh + N 4/5 (log3d) < 2)V 4/5 (log3<i), 


when N is large enough. A slight refinement yields the theorem. 

We first observe that, if we assume a slightly stronger condition than 

. p(n,w,A) 

limint - < Too 

n —>-+oo 11 


□ 


in Lemma 8.1, namely, that 

p(n,w. A) 

ltmsup - < Too, 

n —»+oo W 

then the word w satisfies a much stronger condition than Condition (4k). Indeed, there 
then exists an integer C >2 such that 


p(n,y/,A ) < Cn, for n > 1, 
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instead of the weaker assumption (2.22). In the case of the first proof of Theorem 4.1 
given in Section 9, this means that, keeping its notation, one may assume that, up to 
extracting subsequences, there exists an integer c such that 

2(m„ + v„ + w„) < w„+i + v n+ i + w n+ i < c(u„ + v„ + w„), n > 1. 

This observation is crucial for the proofs of Theorems 12.2 to 12.4 below. 

Now, we mention a few applications to the complexity of algebraic numbers, be¬ 
ginning with a result from [31]. Recall that the complexity function n i-> p(n. 6 , b) 
has been defined in Section 2. 


Theorem 12.2. Let b > 2 be an integer and 6 an algebraic irrational number. Then, 
for any real number tj such that rj < 1/11 , we have 


lim sup 

tt—>- + 0O 


p(n , 9, b ) 
n (log n) r > 


The main tools for the proof of Theorem 12.2 are a suitable extension of the 
Cugiani-Mahler Theorem and a suitable version of the Quantitative Subspace Theo¬ 
rem, which allows us to get an exponent of log/; independent of the base b. Using 
the recent results of [41] allows us to show that Theorem 12.2 holds for tj in a slightly 
larger interval than [0,1/11). 

As briefly mentioned in Section 6, one of the main features of the theorems of 
Roth and Schmidt is that they are ineffective, in the sense that we cannot produce an 
explicit upper bound for the denominators of the solutions to (2.19) or for the height 
of the subspaces containing the solutions to (2.20). Consequently, Theorems 3.1 and 
3.2 are ineffective, as are the weaker results from [14, 43]. It is shown in [24] that, by 
means of the Quantitative Subspace Theorem, it is possible to derive an explicit form 
of a much weaker statement. 


Theorem 12.3. Let b > 2 be an integer. Let 9 be a real algebraic irrational number 
of degree d and height at most H, with H > e e . Set 

M = exp{10 190 (log(8t/)) 2 (log log(8r/)) 2 } + 2 22Xo ^ U0Xo ^ H)) . 


Then we have 


p{n , 6, b ) > 



n, 


forn > 1. 


Unfortunately, the present methods do not seem to be powerful enough to get an 
effective version of Theorem 3.1. 

We have shown that, if the b- ary or the continued fraction expansion of a real num¬ 
ber is not ultimately periodic and has small complexity, then this number cannot be 
algebraic, that is, the distance between this number and the set of algebraic numbers 
is strictly positive. A natural question then arises: is it possible to get transcendence 
measures for £, that is, to bound from below the distance between / and any algebraic 
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number? A positive answer was given in [6], where the authors described a general 
method to obtain transcendence measures by means of the Quantitative Subspace 
Theorem. In the next statement, proved in [9], we say that an infinite word w written 
on an alphabet A is of sublinear complexity if there exists a constant C such that the 
complexity function of w satisfies 

p{n , w, A) < Cn, for all n > 1. 

Recall that a Liouville number is an irrational real number y such that for every 
real number w, there exists a rational number p/q with \y — p/q\ < 1 /q w . 

Theorem 12.4. Let f be an irrational real number and b > 2 be an integer. If the 
b-ary expansion of f is of sublinear complexity, then, either £ is a Liouville number, 
or there exists a positive number C such that 

\H-0\>H(0)-W C1 ° gl ° sOd \ 

for every real algebraic number 9 of degree d. 

In Theorem 12.4, the quantity H(9 ) is the height of 0 as introduced in Defini¬ 
tion 6.3. 

The analogue of Theorem 12.4 for continued fraction expansions has been estab¬ 
lished in [7, 28]. The analogues of Theorems 12.2 and 12.3 have not been written yet, 
but there is little doubt that they hold and can be proved by combining the ideas of 
the proofs of Theorems 3.2, 12.2 and 12.3. 


13 Further notions of complexity 

For an integer b > 2, an irrational real number f whose b- ary expansion is given by 
(2.1), and a positive integer n, set 

NZ{n,%,b) :=#{£: 1 <i< n,at 0}, 

which counts the number of non-zero digits among the first n digits of the b- ary 
expansion of £. 

Alternatively, if 1 < n i < n 2 < ■•■ denotes the increasing sequence of the indices 
l such that a 1 f 0, then for every positive integer n we have 

J\fZ{n, £, b) := max {j :nj<n}. 

Let s > 0 be a real number and 9 be an algebraic, irrational number. It follows 
from Ridout’s Theorem 6.6 that nj + \ < (1 + s)n j holds for every sufficiently large 
j. Consequently, we get that 

MZ(n, 9, b ) 

ltm --- = +00. 

«^+oo log n 
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For the base b = 2, this was considerably improved by Bailey, Borwein, Crandall, 
and Pomerance [17] (see also Rivoal [60]), using elementary considerations and ideas 
from additive number theory. A minor modification of their method allows us to get 
a similar result for expansions to an arbitrary integer base. The following statement 
is extracted from [27] (see also [11]). 

Theorem 13.1. Let b > 2 be an integer. For any irrational real algebraic number 9 
of degree d and height H and for any integer n exceeding (20 b d d 3 H) d , we have 

MZ(n,9,b) > —'— ( H ^, 

b — 1 \2{d + 1 )ad ) 

where a d denotes the leading coefficient of the minimal polynomial of 6 over the 
integers. 

The idea behind the proof of Theorem 13.1 is quite simple and was inspired by 
a paper by Knight [48]. If an irrational real number £ has few non-zero digits, then its 
integer powers f 2 . £ 3 ,..., and any finite linear combination of them, cannot have too 
many non-zero digits. In particular, f cannot be a root of an integer polynomial of 
small degree. This is, in general, not at all so simple, since we have to take much care 
of the carries. However, for some particular families of algebraic numbers, including 
roots of positive integers, a quite simple proof of Theorem 13.1 can be given. Here, 
we follow [60] and (this allows some minor simplification) we treat only the case 
b = 2. 

For a non-negative integer x, let B(x) denote the number of l’s in the (finite) 
binary representation of x. 

Theorem 13.2. Let 9 be a positive real algebraic number of degree d > 2. Let 
a d X d + • • • + a\X + «o denote its minimal polynomial and assume that a i,..., a d 
are all non-negative. Then, there exists a constant c, depending only on 9, such that 

MZ(n,9,2) > B(a d )- l/d n 1/d -c, 


forn > 1. 

Proof. Observe first that, for all positive integers x and y, we have 

B(x + y) < B(x) + B(y) 

and 

B(xy) < B(x) B(y). 

For simplicity, let us write J\fZ(n , •) instead of MZ(n , 2). Let £ and rj be posi¬ 
tive irrational numbers (the assumption of positivity is crucial) and n be a sufficiently 
large integer. We state without proof several elementary assertions. If £ + rj is in a¬ 
tional, then we have 

MZ{n,% + rj) < NZ(n,%) + J\fZ(n , rj) + 1. 
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If £ ry is irrational, then we have 

A rZ(n,%rj) < AfZ(n,%) ■ AfZ(n, rj) + log 2 (£ + rj + 1) + 1, 

where log 2 denotes the logarithm in base 2. If m is an integer, then we have 

J\fZ(n,m%) < B(m)(AfZ(n, £) + 1). 

Furthermore, for every positive integer A, we have 

AfZ(n,$)-JSfZ(n,A/$) >n-l- log 2 (£ + A/$ + 1)). (2.65) 

Let 0 be as in the statement of the theorem. The real number \ao\/9 is irrational, 
as are the numbers ajO J '~ l for j = 2,... ,d provided that aj ^ 0. Since 

\ao\e Cl\ @29 d d 9 d 

and NZ(n , 9) tends to infinity with n, the various inequalities listed above imply that 

A fZ(n, |ao|0 _1 ) < d + B{a\) + J\fZ(n,ci 20 ) + • • • + J\fZ(n,a d 9 d ~ 1 ) 

< d + B(a\) + B(ci 2 ){NZ{n, 0) + 1) + • • • 

+ B(a d )(AfZ(n,9 d ~ 1 ) + 1) 

< B{a d )UZ{n , 9) d ~ l + c x NZ(n, 9) d ~ 2 , 

where c\, like c 2 , Cj, c 4 below, is a suitable positive real number depending only on 
9. By (2.65), we get 

n 


( 2 . 66 ) 


J\fZ(n,\ao\9 )> 


c 2 . 


A fZ(n,9) 

Combining this with (2.66), we obtain 

B{a d )UZ{n,9) d + c 3 AfZ(n , 9) d ~ l > n 
and we finally deduce that 

A fZ(n,9) > B(a d )~ 1 /d n l/d - c 4 , 

as asserted. 


□ 


We may also ask for a finer measure of complexity than simply counting the num¬ 
ber of non-zero digits and consider the number of digit changes. 

For an integer b > 2, an irrational real number f whose 6-ary expansion is given 
by (2.1), and a positive integer n, we set 

AfBT>C(n,%,b) :=#{£: 1 < i <n,ai ^ ai +1 }, 

which counts the number of digits followed by a different digit, among the first n 
digits in the b -ary expansion of A The functions n i-> J\fBT>C ( n , £, b) have been 
introduced in [23]. Using this notion for measuring the complexity of a real number, 
Theorem 13.3 below, proved in [23, 31], shows that algebraic irrational numbers are 
‘not too simple’. 
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Theorem 13.3. Let b > 2 be an integer. For every irrational, real algebraic number 
9, there exist an effectively computable constant )io(0 , b), depending only on 9 and 
b, and an effectively computable constant c, depending only on the degree of 9, such 
that 

J\fBT>C(n , 0,6) > c (log/?) 3 / 2 (log log n) -1 / 2 (2.67) 

for every integer n > n q (0, b). 

A weaker result than (2.67), namely that 

J\fBT>C(n,9,b) 

lim -= +oo, (2.68) 

n^+oo log 77 

follows quite easily from Ridout’s Theorem 6.6. The proof of Theorem 13.3 depends 
on a quantitative version of Ridout’s Theorem. We point out that the lower bound in 
(2.67) does not depend on b. 

Further results on the number of non-zero digits and the number of digit changes 
in the 6-ary expansion of algebraic numbers have been obtained by Kaneko [45,46]. 
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1 Introduction 

In this short course, we aim to highlight connections between a certain class of ma¬ 
trices and Dirichlet series, in particular the Riemann zeta function. The matrices we 
study are of the form 


/ /(l) 

/(§) 


f{\) ••• \ 


m 

/a) 

/(f) 

/(f) ••• 


m 

/(§) 

/(l) 

/(f) ••• 

(*) 

m 

/( 2) 

/(f) 

/(l) ••• 



V 


/ 
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i.e., with entries a t j = f(i/j ) for some function / : Q + — > C. They are a multiplica¬ 
tive version of Toeplitz matrices which have entries of the form a, j = < 2 ;-, . For this 
reason we call them Multiplicative Toeplitz Matrices. 

Toeplitz matrices (and operators) have been studied in great detail by many au¬ 
thors. They are most naturally studied by associating with them a function (or ‘sym¬ 
bol’) whose Fourier coefficients make up the matrix. With ajj = a t -j , this ‘symbol’ 
is 

OO 

a(t ) = ^ a„t n . t e T 

n=— oo 

Then properties of the matrix (or rather the operator induced by the matrix) imply 
properties of the symbol and vice versa. For example, the boundedness of the opera¬ 
tor is essentially related to the boundedness of the symbol, while invertibility of the 
operator is closely related to a(t) not vanishing on the unit circle. 

For matrices of the form (*) we associate, by analogy, the (formal) series 

I] mq lt , 

?eQ+ 

where q ranges over the positive rationals. Note, in particular, that if / is supported 
on the natural numbers, this becomes the Dirichlet series 

£/(«)«''■ 

ne N 


In the special case where /(/;) = n ~ a , the symbol becomes £(a — it). It is quite 
natural then to ask to what extent properties of these Multiplicative Toeplitz Matrices 
are related to properties of the associated symbol. Rather surprisingly perhaps, these 
type of matrices do not appear to have been studied much at all - at least not in this 
respect. Finite truncations of them have appeared on occasions, notably Redheffer’s 
matrix [29], the determinant of which is related to the Riemann Hypothesis. Denoting 
by A„(f) the n x n matrix with entries f ( i/j) if j |i and zero otherwise, it is easy to 
see that 


A n {f)A n (g) = A n (f * g ), where f * g is Dirichlet convolution, 
since the i j lh entry on the left product is 


E 


An(f)t,A n (g)ri = E/(;My) = E 


J\r\i 


d\i/j 


if j\i by putting r — jd, and zero otherwise. With 1 and /i denoting the constant 
1 and the Mobius functions, respectively, it follows that /l„(I) A„(p) = I n — the 
identity matrix. Note also that det A n (1) = det A n (ji) = 1. Redheffer’s matrix is 


/ 

0 

i 

1 • 

1 

\ 


0 

0 

0 • 

• 0 


V 

0 

0 

0 • 

• 0 

J 


Rn — A n ( 1) + 


= A n (l) + E n 
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say, where the matrix E n has only Is on the topmost row from the 2nd column on¬ 
wards. Then, with M(n) — Ylr=i ri( r )> 



/ 

M(n) - 1 

* 

* 

\ 


/ M(n) 

* 

* 

\ 



0 

0 • 

• 0 



0 

1 

• 0 


hi + 






- 







0 

0 • 

• 0 

) 


V 0 

0 • 

1 

/ 


so that det R n — M(n). The well-known connection between the Riemann Hypothe¬ 
sis (RH) and M(n) therefore implies that RH holds if and only if det R n = 0(n 2 +£ ) 
for every e > 0. (See also [20] for estimates of the largest eigenvalue of R n ). 

Briefly then, the course is designed as follows: In Section 2, we recall some 
basic aspects of the theory of Toeplitz operators, in particular their boundedness and 
invertibility. In Section 3, we study bounded multiplicative Toeplitz operators. This 
is partly based on some of Toeplitz’s own work [34], [35] and recent results from [17] 
and [18], but we also present new results, mainly in Section 3. Thus Theorem 3.1 is 
new, generalising Theorem 2.1 of [18], which in turn is now contained in Corollary 
3.2. Also Subsection 3.2 and parts of 3.4 are new. 


Preliminaries and Notation 


(a) The sequence spaces l p (1 < p < oo) consist of sequences ( a n ) for which 
Yi^= i \ a n\ p converges. They are Banach spaces with the norm 

/ 00 \ i Ip 

\\{an)\\ p = • 

\j=l ' 

The space /°° is the space of all bounded sequences, equipped with the norm 
||(a„)||oo = sup„ €N I a n\- We shall also use l p (Q + ), which is the space of se¬ 
quences a q where q ranges over the positive rationals such that \a q \ p < oo, 
with analogous norms and also for p — oo. 

I 2 and / 2 (Q + ) are Hilbert spaces with the inner products 

OO 

(a,b) = Ya n b n and (a,b) = a q b q , 

n =1 ?eQ+ 


respectively. 

(b) Let T — {z e C : \z\ — 1} - the unit circle. We denote by L 2 ( T) the space of 
square-integrable functions on T. L 2 (T) is a Hilbert space with the inner product 
and corresponding norm given by 


(fg) = 



i r 2n 

2rr J 0 


f(e lti )g(e id ) d6, 
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The space L°°(T) consists of the essentially bounded functions on T with norm 
ll/lloo denoting the essential supremum of /. (Strictly speaking, L 2 and L°° 
consist of equivalence classes of functions satisfying the appropriate conditions, 
with two functions belonging to the same class if they differ on a set of measure 
zero.) 

Let Xn(t) = t n for n £ L. Then (Xn)nsz is an orthonormal basis in L 2 ( T) 
and L 2 (T) is isometrically isomorphic to l 2 CL) via the mapping / i-> (fn)nei^ 
where f n are the Fourier coefficients of /, i.e.. 



(c) A linear operator <p on a Banach space X is bounded if \\(px || < C||x|| for all 
x e X. In this case the operator norm of <p is dehned to be 

II03XII 

ll<»ll = su p -n-r = su p ll^ll- 

xeX.x^O IN II || jc || = 1 

The algebra of bounded linear operators on X is denoted by B(X). 

(d) An infinite matrix A = (ay ) induces a bounded operator on a Hilbert space H if 
there exists <p e B(H) such that 


ttij — 

where (c,-) is an orthonormal basis of H. Note that not every infinite matrix 
induces a bounded operator, and it may be difficult to tell when it does. 

(e) For the later sections we require the usual O, o, ~, notation. Given /, g de¬ 
fined on neighbourhoods of oo with g eventually positive, we write /(x) = 
o(g(x)) (or simply / = o(g)) to mean lim^oo f(x)/g(x) = 0, f(x) = 
0(g(x)) to mean |/(x)| < Ag(x) for some constant A and all x sufficiently 
large, and /(x) ~ g(x) to mean lim.v^oo f(x)/g(x) = 1. 

The notation / g means the same as / = 0{g), while / < g means 
f(x) < (1 +o(l)k(x). 


2 Toeplitz matrices and operators - a brief overview 

Toeplitz matrices are matrices of the form 


/ a o 

a -1 

a~ 2 

a- 3 

ai 

«o 

a -1 

a~2 

fl2 

a i 

flo 

O-l 

a 3 

«2 

a i 

Oo 


V 



(3.1) 
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i.e., T — ( tij ), where t,j — cii-j. They are characterised by being constant on 
diagonals. 

For a Toeplitz matrix, the question of boundedness of T was solved by Toeplitz. 

Theorem 2.1 (Toeplitz [34]). The matrix T induces a bounded operator on l 2 if and 
only if there exists a e L°°(T) whose Fourier coefficients are a n (n e Z). If this is 
the case, then ||7j| = ||«||oo- 


We refer to the function a as the ‘symbol’ of the matrix T, and we write T(a). 
Sketch of Proof For a e L 2 (T), the multiplication operator 
M(ay. L 2 ( T) —► L 2 ( T), fw^af 


is bounded if and only if a e L°°(T). If bounded, then | M (a) || = ||fl||oo- The matrix 
representation of M(a) with respect to (Xn)ne z is given by 

(M(a)xj, Xi) — i a Xj’Xi) — / a Xj~Xi — / aj— = ai-j, 

J t Jt 


i.e., by the so-called Laurent matrix 


L(a) : = 


••• a 0 

a -1 

a -2 

a -3 

a- 4 ■ ■ ■ 

ai 

a 0 

a -1 

12—2 

a-3 ■■■ 

• •• a 2 

a\ 


a -1 

a~ 2 ••• 

• •• a-i 

a 2 

ai 

ao 

a -1 ••• 


(3.2) 


The matrix for T is just the lower right quarter of L(a ). We can therefore think of T 
as the compression PL(a)P, where P is the projection of / 2 (Z) onto l 2 — / 2 (N). 
An easy argument shows that T is bounded if and only if a e L°°, and then \\T\\ = 
\\L(a)\\ = ||a||oo. □ 


Hardy space. Let H 2 { T) denote the subspace of L 2 ( T) of functions / whose 
Fourier coefficients f n vanish for n < 0. Let P be the orthogonal projection of 
L 2 onto H 2 , i.e., P(J2„ez fnXn) = J2„>ofnXn- The operator / P(af) has 
matrix representation (3.1). For, with j > 0 (so that Xj e H 2 ( T)), 


(T(a)Xj^Xi) 


= / P{axj)Xi = / P(J2 an X»+j = J2 an ~j / 

Jj K neZ > n> 0 Jj 


Xn—i — &i—j 


if i > 0, and zero otherwise. Hence, we can equivalently view T ( a ) as the operator 
T(a): H\ T) H 2 { T), / ^ P(af). 
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2.1 C(T), W(T), and winding number Let C(T) denote the space of con¬ 
tinuous functions on T. For a e C(T) such that a(t ) ^ 0 for all t e T, we de¬ 
note by wind (a, 0) the winding number of a with respect to zero. More generally, 
wind(a. A) =wind(a — A,0) denotes thewinding number with respect A £ C. For 
example, wind(/„, 0) = n. 

The Wiener Algebra is the set of absolutely convergent Fourier series: 

I oo oo \ 

X a n*n : X < °°[ ■ 

Some properties: 

(i) W(T) forms a Banach algebra under pointwise multiplication, with norm 

OO 

IMI w ■= X l a «l- 

—oo 

(ii) (Wiener’s Theorem) If a e W and a(t) 0 for all t e T, then a~ l e W. 

(iii) If a e W(T) has no zeros and wind(a, 0) = 0, then a = e b for some b e W(T). 
We have 

W(T) c C(T) C L°°(T) C L 2 { T). 

2.2 Invertibility and Fredholmness Let A be a bounded operator on a Banach 
space X. 

(i) A is invertible if there exists a bounded operator B on X such that AB = BA = 
I. As such, B is the unique inverse of A, and we write B = A -1 . The spectrum 
of A is the set 

a (A) = {A e C : XI — A is not invertible in X}. 

The kernel and image of A are defined by 

Ker A — {x e X : Ax — 0}, ImA — {Ax : x e X}. 

(ii) The operator A is Fredholm if ImA is a closed subspace of X and both Ker A 
and X /ImA are finite-dimensional. As such, the index of A is defined to be 

Ind A = dim Ker A - dim (A/Ini A). 

For example, T(/ n ) is Fredholm with Ind T(x n ) = —n. 

Equivalently, A is Fredholm if it is invertible modulo compact operators; that 
is, there exists bounded operator B on X such that AB — I and BA — I are both 
compact. 

The essential spectrum of A is the set 

Oess(A) = {A e C : XI — A is not Fredholm in X}. 

Clearly a eS s(A) C a(A). Note that A invertible implies A is Fredholm of index 
zero. For Toeplitz operators, the converse actually holds (see [5], p. 12). 
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2.3 Hankel matrices These are matrices of the form 


/ a i 

a 2 

a-i 

04 

... V 

a 2 

a 3 

a 4 

05 



a 4 

a 5 

fl 6 


a 4 

a 5 

a 6 

07 


V ! 




/ 


i.e., H — ( hjj ), where h (/ — o, + . They are characterised by being constant on 

cross diagonals. The boundedness of H was solved by Nehari, and the compactness 
of H by Hartman. 

Theorem 2.2 ([28], [14]). The matrix H generates a bounded operator on l 2 if and 
only if there exists b E L°°(T) (with Fourier coefficients b n ) such that b n = a n for 
n > 1 . Furthermore, the operator FI is compact if and only ifb E C(T). 


We refer to the function a as the ‘symbol’ of the matrix H, and we write H{a). 
Given a function a defined on T, let a be the function 

a{t) = a(\/1) (t e T). 


Proposition 2.3. Fora.b E L°°(T), 

T(ab) = T(a)T(b) + H(a)H(b) 
H(ab) = H(a)T(b) + T(a)H(b). 


Proof The matrix L(a ) in (3.2) is of the form 


L(a) = 


( T(a) H(a) \ 
V H(a) T(a) ) 


Since L(ab) = L(a)L(b), the result follows by multiplying the 2 x 2 matrices. □ 

As a special case, we see that Tfabc) — T(a)T(b)T(c) for a E H°°,b E 
L°°,c E H°°. The space H°° is defined analogously to L°°. ( T(a ) is upper- 
triangular and T (c) is lower-triangular.) 

By Theorem 2.2, if a,b E C(T), then H(a)H(b ) is compact, so that T(ab) — 
T{a)T(b ) is compact. In particular, if a has no zeros on T, we can take b = a~ l E 
C(T). Then T{ab) = T'(l) = /, so T(a) is invertible modulo compact operators 
(i.e., Fredholm) with ‘inverse’ T{a~ l ). This type of reasoning leads to: 


Theorem 2.4 (Gohberg [9]). Let a e C(T). Then T(a) is Fredholm if and only if a 
has no zeros on T, in which case 


Ind T(a) = — wind(a.O). 
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Hence T(a) is invertible if and only if a has no zeros on T and wind (a, 0) = 0. 
Equivalently, since T(X — a) = XI — T(a)forX e C, we have 

<J ess (T{a)) = a(T), 

o(T(a)) = a(T) U {A e C \ a(T) : wind(n, A) 0}. 

Sketch of Proof. We have seen that a f 0 on T implies T{a) is Fredholm. In this 
case, let wind (a. 0) = k. Then a is homotopic to /*, and (since the index varies 
continuously) 

Ind T(a) — Ind T(xk ) = — k — —wind (a, 0). 

For the converse, suppose T(a) is Fredholm with index k, but a has zeros on T. 
Then a can be slightly perturbed to produce two functions b, c e W( T) without zeros 
such that ||a — 6||oo and ||n — c'Hoo are as small as we please, but wind(fr,0) and 
wind(c, 0) differ by one. As the index is stable under small perturbations, T(b) and 
T(c) are Fredholm with equal index. But Ind T(b ) = —wind(Z>, 0) and Ind 7 (c) = 
—wind(c, 0) (by above), so wind(/). 0) — wind(c, 0) = 0 — a contradiction. □ 

2.4 Wiener-Hopf factorization Since W{T) c C(T), Theorem 2.4 applies 
to 1T(T). However, for Wiener symbols we can obtain a quite explicit form for the 
inverse when it exists. This is because Wiener functions can be factorized. 

Denote by W+ and W~ the subspaces of W consisting of functions 

oo oo 

^2 a nt n an d I eT 

n—0 n =0 

respectively, where Yl\ a n\ < oo. 

Theorem 2.5 (Wiener-Hopf factorization). Let a e W( T) such that a has no zeros, 
and let wind (a. 0) = k. Then there exist a~ e W~ and a+ e W+ such that 


a = Xk a - a +- 

Proof We have windin'/-*;, 0) = 0. So ax~k = e b for some b e W. But b — 
b- + b+, where b- e W~ and b+ e W+. Hence, writing n_ = e b ~ and a+ — e b+ 
gives 

ax~k = e b ~e b+ = a-a + . □ 

Theorem 2.6 (Krein [23]). Let a e W( T). Then l'(a ) is Fredholm if and only if a 
has no zeros on T, in which case 

Ind T(a) = — wind(a,0). 

In particular, T(a) is invertible if and only if a has no zeros on T and wind(n, 0) = 0. 
In this case 

T(a )- 1 = T{af l )T{aZ l ), 

where a = n+n_ is the Wiener-Hopf factorization of a. 
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Proof of second part. Note that if a e W-, then H(a ) = 0, while if a e W+. then 
H(d) = 0. Suppose a has no zeros on T and wind(a, 0) = 0. Then a factorizes as 
a = a-a + with a± e W±. Applying Proposition 2.3 with a_ and a+ in turn gives 

T(aZ l )T(a-) = T(aZ l aJ) = / = T(a-aZ l ) = T{aJ)T{aZ l ), 
T(af 1 )T(a+) = T(af 1 a+) = I = T(a+af l ) = T(a + )T(af_ l ), 

so that T(a±) are invertible with T(a ±)~ 1 = T{aZ^). But also T(a) = T(a-a+) = 
T(a-)T(a+) (by Proposition 2.3). Hence T{a)~ l = T(a+)~ l T(a ~)~ 1 = 

T(af 1 )T(aZ 1 ). □ 


3 Bounded multiplicative Toeplitz matrices and operators 

3.1 Criterion for boundedness on / 2 Now we consider the linear operators 
induced by matrices of the form (*), regarding them as operators on sequence spaces, 
in particular l 2 . 

For a function /: Q + -^Con the positive rationals, we define 

f(q) = lim Y f ( —Y whenever this limit exists. 

N^-oo z —' V n ' 

fleO+ m.n^N 

1 ^ On ,«) = 1 

We shall sometimes abbreviate the left-hand sum by f(q). We say that / e 

iH Q + )if 

I] \m\ 

?sQ+ 

converges. In this case, the function 

Fit) = 1 e R 

?eQ+ 

exists and is uniformly continuous on R. Note that for A > 0, 
lim [ F{t)X~ lt dt = 

T —>oo 11 J—T 

Theorem 3.1. Let f e / 1 (Q + ) and let <p f denote the mapping (a n ) i-> ( h n ) where 

OO 

bn = lL 

m= 1 


| /(?), if A = q e Q+, 

) 0, otherwise. 
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Then q>f is bounded on l 2 with operator norm 


\Wf\\ = sup 


E f ( ^ lt 


qe(. 


= IIFI 


Proof. We shall first prove that <pf is bounded on l 2 , showing ||<pf || < | /' Hoc in the 
process, and then show that H^Hoo is also a lower bound. 

For q e Q + and TV e N, let 


N oo 




m — l 


m—1 


Note that bq N ^ —> h q as N —>■ oo for every q e Q + , whenever a n is bounded. We 
have the following formulae: 


lim jtf f F{t) E a n n lt dt = J2a n bi N) 
T^oc IT J_ T ^ ^ 

f_ T mY, a * nit dt =H K N) \ 2 - 

n —1 / 7 eO+ 


(3.5) 

(3.6) 


(These hold for a n bounded.) To prove these expand the integrand in a Dirichlet 
series. For the first formula we have 

1 ~t N 2 i „t n _j t 

— F(t)^2ci n n lt dt = E F ^(~) dt 

J ~ T n= 1 m,n<N J ~ T 


N 


n= 1 


—► E a « fl «/(^) = J2 a ” b n N) 

m,n<N 

as T —> oo. For the second formula, note first that 
N 

F(t)^2 a „n lt = E f{q)a n (qn) lt 


n= 1 


qeQ+ ,n<N 


reQf n<N 

the series converging absolutely. Thus 
•ri N 1 2 

2 T 


= E(E/ 0 “-V" = E ev". 


f r F (, 2 *= ^ (^"a^ei'Ti 

^ l <?i,92£Q + T 2 ?eQ + 


as T —> oo. 
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Since |F(f)| < Hi 7 Hoc, we have 

j TV 2 TV 

E l*ri'-ll^ll-r 1 ^^/ ^=ll^l| 2 ooEl«n| 2 - 

<jreQ+ “ T n = 1 n = l 

Thus if a = (a n ) e l 2 , we have \bi N) \ 2 < ||T'||^ 0 ||o|| 2 for every N. Letting 

N —> oo shows that (7>„) e / 2 too (indeed ( b q ) e ( 2 (Q + )), and so <pf is bounded on 
l 2 , with 

I Wf\\ < Halloo. 

Now we need a lower bound. By Cauchy-Schwarz, 


oo |2 oo 


Y clnhn <EM 2 -El^ 


72=1 77 = 1 


Thus ||^y|| > | @nbn | for every a = ( a „) e / 2 with ||a|| = 1. Choose r/„ as 

follows: let (V e N (to be determined later) and put 


a„ = — , for n I TV, and zero otherwise. 

JW) 

Here d(N) is the number of divisors of N. Thus (a n ) e / 2 and Hull = 1. With this 
choice, 

b n = -^Yf(-)m- u (= bi N >) 

Km) 


and so 


^ E fQO" 


n\N m\N 


where 


?eQ+ 

S q (N)= E L 


Put q = j, where (A:, /) = 1. Then ^ = j if and only if /« = Arm. Since (7:, /) = 1, 
this forces k\n and / \m. So, for a contribution to the sum, we need k, l\N, i.e., kl\N. 
Suppose therefore that kl\N. Then 

Sq(N) = Y ^ = E ' m = r ^' n = r k r e ^ 

m,n\N rk,rl\N 

In — km 


= E *-0 
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Writing \q\ = kl whenever <7 = j in its lowest terms, gives 


00 

^ ' &nbn 

n—\ 


E 

<7 €Q+ 
1*11# 


f(q)q 


i, d(N/\q \) 
d(N) 


(3.7) 


The idea is now to choose N in such a way that it has all ‘small’ divisors while 
' s c l° se t0 1 f° r all such small divisors \q\. Take N of the form 


N = JT p ap , where a p 
p<p 


log P ] 
logp J" 


Thus every natural number up to P is a divisor of N. Every q such that |g[|./V is of 
the form \q\ — ] — [ / 7 < J p pPp (0 <P P < a p \ so that 


d(N/\q\) _ I r / _ Pp \ 
d(N) ~a p + l> 


If we take \q\ < y/log P , then p^ p < ylog P for every prime divisor p of \q\. 
Hence, for such p, ft p < ^ and = 0 if p > ylog P . Thus 


d(N/\q |) 
d(N) 


= n_( 

P<\/logP 


1 - 


-( 


1 - 


Up + 

log log P X Tr(VlogP) 
2 log P / 


i)* n ( 

p<s/ iog 7 


1 - 


log log P \ 
2 log P ) 


where tc(x) is the number of primes up to x. Since n(x) = 0( j^), it follows that 
for all P sufficiently large, the RHS above is at least 


A 

y/^gP 


for some constant A. 

2 

Let s > 0. Then there exists n o such that X^|>h 0 [/(<?)| < e - Choose P > e n o 

so that v /log P > li q. Then the modulus of the sum in (3.7) can be made as close to 
|| F ||oo as we please by increasing P, for it is at least 


E 

\q\<*/\ogP 


> 


E f^)q lt 

A' 


qeQ-<- 

> \F(t)\- 


y/to.gP 
A 


T 




-2s, 


E E i/(?)i 

kl<\/iog p l?l>Vi°g^ 

E \m\- 2 E 1 /( 9 )i 


qe{ 


\q\>J\ogP 
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where s can be made as small as we please by making P large. Since this holds for 
any t, we can choose t to make F(t) as close as we like to ||.F|| (X) . Hence \\(pf\\ > 
|| F || oo and so we must have equality. □ 

In the special case where / > 0, the supremum of \F(t)\ is attained when t — 0, 
in which case ||F||oo = F(0) = ||/|Ii,q+- Thus: 

Corollary 3.2. Let f : Q + —> C such that f > 0. Then <p f is bounded on l 2 if and 
only if f e l 1 (Q + ), in which case \\tpf\\ = ||/||i,q+. 

Example. Take fin) = n~ a for/; e N, and zero otherwise. Then F(t) = f(a —it). 
We shall denote cp/ by (p a in this case. Applying Corollary 3.2, we see that (p a is 
bounded on / 2 if and only if a > 1, and the norm is f (a). 

3.2 Viewing </>/ as an operator on function spaces; Besicovitch space 

We can view <pf as an operator on functions rather than sequences. For this we need 
to construct the appropriate spaces. 

Let A denote the set of trigonometric polynomials; i.e., the elements of A are all 
finite sums of the form 

^a k e lXkt , 

k= 1 

where a k e C and e K. The space A has an inner product and a norm given by 


(f,g) = lim ^ f fg and ||/|| = J(f f) = J lim f |/| 2 . 
T^oo Z1 J-j y T^oo 11 j 

Now let B 2 (Besicovitch space) denote the closure of A with respect to this inner 
product; i.e., / e B 2 if \\f — f n || —> 0 as n —> oo for some /„ e A. We turn B 2 into 
a Hilbert space by identifying / and g whenever ||/ — g|| =0. (See [3], Chapter II.) 
Now write xx(t) = 2" (A > 0, / eR) and let /(A) denote the Fourier coefficient 

1 r T _ l f T 

/(A) = lim — / f xx = lim — / f(t)X lt dt where it exists. 

7 —>oc z 1 J—T T^-oo z 1 J—j 1 

Denote by T the space of locally integrable /: R —» C such that /(A) exists for all 
A > 0. 

(a) Fourier coefficients and series For f e B 2 , the Fourier coefficients f (A) ex¬ 
ist and f (A) is non-zero on at most a countable set, say {A„ The function 
f has the (formal) Fourier series Xw ; >i / (^n)^n- 

(b) Uniqueness f g e B 2 have the same Fourier series if and only if \\f—g\\ = 0. 
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(c) Parseval For f e B 2 , 



and, more generally, for fg e B 2 , 

ifg) = lim f fg = J2 

T^oa 11 J_ T “ 

(d) Riesz-Fischer Theorem Given A„ > 0 and a n e l 2 , there exists f e B 2 
such that f(t ) ~ J2n> l a nKl ■ 

(e) Criterion for membership in B 2 : With J~ as before, if f e T and Parseval’s 
identity (3.8) holds, then f e B 2 . 

Indeed, the set of A for which / (A) f 0 is necessarily countable and we 
may write this as {A x , A 2 ,...} with [/( X k )\ 2 = \\f\\ 2 . Let f n (t) = 

2 ~lk<n /(Afc)Ajt. Then 

11/ - fn \\ 2 = ll/ll 2 - \\fn \\ 2 = Y l/WI 2 ^ 0 as n -> oo. 

k>n 


The analogues of the Hardy and Wiener spaces: B 2 + , B 2 , lVq+, W 

(a) Let B 2 i+ denote the subspace of B 2 of functions with exponents A = log q 

for some q e Q + . Alternatively, start with the subset of A consisting of finite 
trigonometric polynomials of the form Yl a qXq^ where q ranges over a finite 
subset of Q + , and take its closure. 

(b) Let denote the subspace of B 2 of functions with exponents A = log n for 
some n e N. This is the analogue of the Hardy space. 

(c) Let !Fq+ denote the set of all absolutely convergent Fourier series in B 2 , ; i.e. 

W Q + = { Y c ^Xq : Y < °°}- 

<?eQ+ </eQ+ 

This is the analogue of the Wiener algebra. As we saw earlier, if 

f = Y C ^)A? e W Q +, 

qeQ+ 

then f (q) = c(q). With pointwise addition and multiplication, Wq+ becomes 
an algebra. Further, Wq+ becomes a Banach algebra with respect to the norm 

\\f\\w = Y 

?eQ+ 

Analogously, let ITidenote the set of absolutely convergent series x a n n 11 . 
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/i, 2 | has (Xq)q& q+ as an orthonormal basis while B 2 has (/„ )„ eN as an orthonor¬ 
mal basis. The spaces B 2 , and B 2 are isometrically isomorphic to / 2 (Q + ) and l 2 , 
respectively, via the mappings 

/ ^ {f(q)} q e Q+ and / {/(«)}„>!• 

The operator M(f). Let P be the projection from B 2 , to B 2 ; that is, 

E c(q)q a ) = J2 c ( n)nlt - 

? gQ+ «eN 

For / € Wq+, we define the operator M(f): B 2 B 2 by g i-^ P(fg). 

The matrix representation of M(f) w.r.t. {/„ }„ e u is the multiplicative Toeplitz 
matrix (f(i/j))ij> t- Indeed, if f = J2 q f(q)Xq ■ then 

P(fxj) = p{J2,f(q)xqXj) = Pfcmxqj) 

q q 

oo 

= '(£ f(q/j)Xq ) = ^2f(n/j)Xn- 

q n =1 


Hence 

oo 

( M(f)xj,xi) = (P(fxj)-Xi) = E/(”/;')<*„, *i> = /(-)■ 

n = l J 


In terms of the operator <p, 

M(/) = r “V/r. 


3.3 Interlude on £ (.V) Since this work concerns connections to Dirichlet series 
and the Riemann zeta function in particular, we recall a few relevant facts regarding 

m- 

The Riemann zeta function is defined for > I by the Dirichlet series 


OO - 

= E 


n = 1 


In this half-plane (s ) is holomorphic and there is an analytic continuation to the 
whole of C except for a simple pole at s = 1 with residue 1. Furthermore, £ (s) 
satisfies the functional equation 


S(.s) = xMQ-s) 
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where 


*0) = - s) sin(y) = n s ^ ^ ^ ■ 


2 > - F(§) 

The connection to prime numbers comes from Euler’s remarkable product formula 


m = n^ 

1 1 1 - E 


Sfls > 1 


The order of £(s). Considering £(ct + it) as a function of the real variable t for 
fixed (but arbitrary) ct, it is known that for \t\ large 

tcr(0 = t( CT + it) = 0(\t\ A ) for some A. 

The infimum of such A is the order of £ and is called the Lindelof function; i.e., 
/x(cr) = inf{H : £(cr + it) = 0(\t\ A )}. 


From the general theory of functions it is known that the Lindelof function is convex 
and decreasing. Since is bounded for a > 1, but -/-? 0, it follows that /x(ct) = 0 
for a > 1. By the functional equation and continuity of // we then have 




0, if ct > 1, 
5 — a, if ct < 0. 


(O) 


For 0 < ct < 1, the value of /i ( ct ) is not known, but it is conjectured that the two 
line segments in (O) above extend to ct = This is the Lindelof Hypothesis. It is 
equivalent to the statement that 

+ it) = 0(t E ) for every s > 0. 


The Lindelof Hypothesis is a major open problem and is a consequence of the Rie- 
mann Hypothesis, which states that t, (,v) f 0 for a > 


Upper and lower bounds for t, a . Let 

Z a {T) — max |^(cr + f/)|. 
t<|fi<r 

(The restriction \t\ > 1 is only added to avoid problems for the case a = 1.) The 
following results hold for large T. 

(a) Z a {T) ^ ^(ct) for ct > 1. 

2 

(b) For ct = 1, unconditionally it is known that Z\(T) = 0((logr)3), while on 
RH 


Z\(T) < 2e v log log T. 
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On the other hand, Granville and Soundararajan [13] proved that 

Zi(T ) >e K (loglog T + log log log T — log log log log T), 

for some arbitrarily large T. They further conjectured that it equals the above 
with an 0(1) term instead of the quadruple log-term. 

(c) For | < a < 1, unconditionally one has Z a (T) — 0(T a ) for various a > 0, 
while on RH 

, ry ^ ^ a (log T) 2 ~ 2a 

log Z a (T) < A- --—--- 

(1 - a ) log log T 

for some constant A. Montgomery [27] showed that 

A” 1/2 (log T) l ~° 


log Z n (T) > 


20 (log log T) a 


and, using a heuristic argument, he conjectured that this is (apart from the con¬ 
stant) the correct order of log Z a (T). In a recent paper (see [24]), Lamzouri 
suggests that 

(log r) 1_tT 

log Z a {T)~C{o) y * 


(log log T) c 


with C(a) = Gi(a) a a 2<J (1 — a) a 1 , where 


GAx) 


pOO / 0 ° 

= log E 

J ° \n=0 


2 n 


(m/2) 

(n ' )2 


du 


/ l + l/.r ' 


(d) For o = j unconditionally one has Zi(T) — 0(T^s (log T) c ) = 0(r° 156 -) 


(see [19]), while on RH 


log T 

logZ^(T) < A & 

2 log log T 


for some constant A. On the other hand, it is known that 

log Zi (T) > c 


log T 


log log T 

(see [2], [31]). Using a heuristic argument. Farmer et al. ([8]) conjectured that 


log Z\(T) 


1 


log T log log T. 


(e) For cr < the functional equation for C, (.v) reduces the problem to the case 
a > So, for example, 

Z a (T) ~ £(1 for ct < 0. 
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Mean values. For o e (|, 1), the mean-value formula 



is well-known (see [33]). Furthermore, 



For higher powers, however, present knowledge is very patchy. It is expected that the 
above formulas extend to all higher moments, i.e., 



This is equivalent to the Lindelof Hypothesis. 

Examples. 

(a) The mean values for and imply that . '(f e Z? 2 for a e (|, 1). Note that 
this also implies [£ CT | 2 e i? 2 + . 

For higher powers, however, only partial results are known. For example, it is 
known that Q e /i 2 if a e (1 — ^,1). Slightly better bounds are available, 
especially for particular values of k, but it is expected that much more holds, 
namely: e /i 2 for every k e N and all o G (|, 1). This is (equivalent to) the 

Lindelof Hypothesis. 

(b) Let g(s) = YlnL i an d h{s) = YlnLi t> e two Dirichlet series which con¬ 
verge absolutely for div > oo and v > r> \. respectively. Let a > oq and /) > 0 \ 
and put f{t) = g(a — it)h(P + it). Then / e Wq+ with 



for q = ™ with (m , n) = 1. 


We can prove this by multiplying out the series for g(a — it) and h{j3 + it). We 
have 



OO 


oo 



OO 


oo 



oo 
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Further Properties of Wq+ and JF N . Notation. For a unital Banach algebra A, 
denote by GA the group of invertible elements of A. and by GqA. the connected 
component of GA which contains the identity. If A is commutative, then 

a £ GqA <*==> a = e b for some b £ A. 


(a) Wiener’s Theorem for Hr ,4 and H/ (see [15], Theorems 1 and 2): 

Let f £ Hq+. Then 1// £ 1Tq+ if and only if infu \ f \ > 0; i.e., GW^+ = 
{/ £ IT q+ : inf R [/[ > 0}. 

Let f(t) = Y^n=\ a n nlt e Wm and put F(s) — Y^=i ^ f or — 0- Then 
1// £ Vbk if and only if there exists 8 > 0 such that |.F(j)| > 8 for all > 0. 

The example f(t) = 2“ shows that the condition \ f(t)\ > 8 > 0 for all t e R is 
not sufficient for 1 // £ IT 4 . 

(b) Let / £ GWq+ . Then the average winding number^ w(ff defined by 


w(f) = 


lim 

T —>-oo 


arg f(T) - arg f(—T) 
2 T 


exists, and w(f) = logg for some q £ Q + (see [21], Theorem 27). 

It is easy to see that (i) w(fg) = w(f) + w(g), and (ii) w(/ 9 ) — log q. 

(c) GoHn = GVFn; i.e. for / £ 1 L^, we have 1 // £ Wfi if and only if / = e g for 
some g £ Hn 


3.4 Factorization and invertibility of multiplicative Toeplitz o pera tors 

The analogue of the factorization T(abc) — T(a)T(b)T(c ) for a £ H°°,b £ 
L°°, c £ H°° for Toeplitz matrices holds for Multiplicative Toeplitz matrices. 

Let W \; denote the set of absolutely convergent series 1 a n n ~ lt ■ 

Theorem 3.3. Let f £ B/, g £ Bq+, and h £ LL],r. Then 

M(fgh) = M(f)M(g)M(h). 

Proof We show the matrix entries agree. By Proposition 3.1, fgh £ Wq+ and 
(M(f)M(g)M(hj) = M(f) lk M(g) kl M(h) li 

11 k,l> 1 

= E /(jMt)*(j) 

k,l> 1 J 

i\k and j\l 

- E 

m,n> 1 


1 Also known as mean motion. 
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On the other hand, since all Q + -series converge absolutely we have 


f(t)g(t)h(t) = X / (qi)g(q2)h(q3)(qiq 2 q3) lt 


< 1 \, 42,4 3 ^ 0 + 



it 


X fgWqW 1 - 


Hence 


M{fgh)ij = fgh(-) = X f(qi)g(~ I —)kq3) = (■ M(f)M{g)M(hj) , 
J n , 7^7 1^3 l J 


since \/q\ and q 3 must range over N. 


□ 


In view of Theorem 3.3, it is of interest to know when a given / e ILq+ factorises 
as f — gh with g e W® and h e . For then M(f) — M(g)M(h) and the 
invertibility of M(f) follows from knowing when M(g) and M(h) are invertible. 
Thus, if h is invertible in W-\. then 

M(h)M(h~ l ) = M(hh~ l ) = M( 1) = I = M( 1) = M{h~ l h) = M{h~ l )M{h), 

so that M(h)~ l = Similarly, if g is invertible in then M(g) _1 = 

M(g~ l ). It follows then that M(/) _1 = M(g)~ l . 

Let J-Wq+ denote the set of functions in IIq,+ which factorise as 


/ = f-lqh 


(3.9) 


where /_ e GIT N , /+ e GITn, and q e Q+. 

Note that with / as above, then 1// = X{\/q)f+ l > so 1 // £ J-W<q+. In 

particular, J : Wq+ C G H / q i + . Note that M(x q ) is invertible if and only if q — 1. 

Theorem 3.4. Let f e J-W,^\. Then M(f) is invertible if and only ifw(f) = 0. If 
this is the case, then M(/) _1 = M(/_ i 7 1 )M(/_J 1 ), with f± as in (3.9). 

Proof Write / = f_ Xq f+ as in (3.9). Then M(f) = M(f-)M( X _ q )M(f+). Now 
M(f-) and M(/+) are invertible, with inverses M{f_ 1 ) and M(/ + 1 ), respectively. 
Thus M(f) is invertible if and only if M( Xq ) is invertible. But this happens if and 
only if q = 1. 

Since w(f) = w(f-) + w( Xq ) + w(f+) = w{ Xq ) = log q, we see that M(f) 
is invertible if and only if w(f) — 0. 

Now suppose w(f) = 0. Then the above gives 
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Multiplicative coefficients and Euler products. 

Definition. 

(a) A function a: Q + —> C is multiplicative if a(\) = 1 and 

a(p°i ■■■P a k k ) = a(p“')--- a (p a k k ), 

for all distinct primes pi and all a, e Z. We say a is completely multiplicative if, 
in addition to the above, 

a(p k ) = a(p) k and a(p~ k ) = a(p ~ l ) k , 

for all primes p and k e N. 

(b) For a subset S of IF, let M.F denote the set of / e S for which / (•) is multi¬ 
plicative. 

(c) Let / e F and p prime. Suppose that I / (p k ) I converges. Then define 

f P = E-/ (p k )X p k- 

keZ 

Note that f p is periodic with period yyfj,- Define f p \ T —> C by 

f}(e i0 ) = f p {9/ log p) = Y, hp k )e kW for 0 <9 < 2n. 

keZ 

Further, denote by W^+ p the set of those / e Wq+ whose Q + -coefficients are 
supported on { p k : k e Z}. (Thus f p e by definition.) 

Note that, for fixed p, there is a one-to-one correspondence between Wq+ p and 
W(T) via the mapping f 

In [16], the Euler product formulas 

/ = E fti)X q = YUp and M(/) = f[ M(f p ), 

geQ+ P P 

were shown to hold whenever / e M. Wq+ . 

The analogue of the Wiener-Hopf factorization holds for M. Wq+ -functions with¬ 
out zeros. 

Theorem 3.5. Let f e M.Wq+ such that f has no zeros. Then f e FWq+. 

Proof. We have / = Y\ p fp, where f p e Wq+ p and each is non-zero. Hence 
f p e W(T) and has no zeros. Let k p = wind ( fp. 0). Note that k p = 0 for all 


98 


Titus Hilberdink 


sufficiently large p, since 


- 1| < E \f{p m )\ < E 1/^)1 ~► 0 as P oo. 

m/O l?l>/> 


Let q = ]""[ p kp (a finite product). 


By Section 2.1 (iii), we have 


for some g^ e W(T). Hence 
with g p e Wq+ p . Thus for P so large that k p = 0 for p > P, we have 


/■# = /. e gp 

Jp A p k P 


f P = X p k P e Sp , 


ru=^ ex p e ^ • 

p<P '/><P 

Now f p (t) —> 1 as p —> oo uniformly in ?, so can choose g p so that g p (f) —> 0 as 
p —» oo (uniformly in ?). Hence, for all sufficiently large p (and all ?), [ f p — I = 
|e^ - 1| < so that 

|^I<2|/ P -1|<2^[/V , )|. 

m^O 


Let g^"* = £>• Then {g(”)} is a Cauchy sequence in Wq+ : for n > m 

\g w - g (m) \< E \8p \< 2 Y Ei/V)i< 2 Ei/>)i — 0 

m<p<n m<p<n m^O r>m 

as in —> oo. Thus —> g e Wq+. But each g^ is of the form h n + k n with 
h n e JTn and k n e ITn (since g p e Wq+,/?)- Thus g = h + k with /r e VFn, k e ITn- 
It follows that / = x g e h e k , which is of the required form. □ 

Note that, as such, 

w(f) = Y w ^ k P p ) = Y k P w ^ P ) = E wind( 4 tt - 0)1 °g^ 

p p p 

where the sum is finite. 

Corollary 3.6. Let f e M. ITq+ such that f has no zeros and w(f ) = 0. Then 
M{ f ) is invertible. 


Example. M(^ a ) is invertible for a > 1 with M(£ a ) 1 = M (!/£«)• 
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4 Unbounded multiplicative Toeplitz operators and 

matrices 

The last section has, in various ways, been a relatively straightforward extension of 
the theory of bounded Toeplitz operators to the multiplicative setting. The theory of 
unbounded Toeplitz operators is rather less satisfactory and not easy to generalise. 
Our particular concern is in fact the operator M(^ a ) for a < 1, since we expect 
a connection with the Riemann zeta function. The hope is that if a satisfactory theory 
is developed for this case, it can be generalised to other symbols. 

In this section, we shall therefore concentrate on the particular case when / (n) = 
n ~ a , when the symbol is £(a — it). Throughout we write (p a (equivalently, M(^ a )) 
for i ipf. 

From Section 3 we see that for a < 1 ,<p a is unbounded. It is interesting to see to 
what extent properties of (p a are related to properties of i) a . The above theory is only 
valid for absolutely convergent Dirichlet series, when the symbols are bounded. But 
^(a — it) is unbounded for a < 1. 

How to measure unboundedness? We shall investigate two different measures. 
The first case can be considered as restricting the range, while in the second case we 
shall restrict the domain. 


4.1 First measure - the function 4>«(iV) With b n defined by a = (a n ) >4 
(b n ), i.e., b n = J2d\n d~ a a n/d , let 



Theorem 4.1. We have the following asymptotic formulae for large N: 

<&i(N) = e y log log V + 0(1) (a = 1) 



Sketch of proof (For the proof see [17]). We start with upper bounds. 
First we note that for any positive arithmetical function g(n), 



(3.10) 


This is because 



\J g(d)a n /d 



g(d)\a n /d \ 2 

d a 


) 
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by Cauchy-Schwarz. Writing G(n) = Yhd\n S(d) 1 d “, we have 


Eiw 2 ^ E G <">E^^ 

n<N n<N d\n 


< max G(n) > 
n<N ^ 

d<N 


g(d) 

d a 


E 

n<N/d 


Taking ||a|| 2 = 1 yields (3.10). The idea is to choose g appropriately, so that the 
RHS of (3.10) is as small as possible. 

For \ < a < 1, choose g ( n ) to be the following multiplicative function: for 
a prime power p k let 

. \ 1, if p k < M, 

g ( p ’ ~ j if p k > M. 

Here M = (2a — 1) log A' and /3 is a constant satisfying 1 — a < /3 < a. Note that 
g(p k ) < g(p) for every k e N and p prime. 

We estimate the expressions in (3.10) separately. First 


E 

n<N 


g (») 
n a 


< 


n(>+E$?)^n( 

p v k =i F 7 p v 


i + 



g(p) } 
p a -V 

(3.11) 


and, after some manipulations using the Prime Number Theorem, one finds for the 


case a < 1 

o y- £00 < 

^ n a ~ (1 — a)(a + — 1) log M 


(3.12) 


Now consider G(n), which is multiplicative because g(n) is. At the prime powers we 
have 


G <A> = E-^ = E i + 4 E 


r ts />“ W) 


r > 0 
p r <M 


M& p( a ~P) r 


r < k 
p r > M 


< 1 + 


+ 


p a — 1 M“(l — pP~ a ) 

(Here we require /l < a.) Note that this is independent of k. It follows that 

G(/7)<expjy^-1—- - —|. 

p a - 1 M a 4^ 1 - 


Pl« 


/>l« 


The right-hand side is maximised when n is as large as possible (i.e. A) and A is of 
the form A = 2.3 ... P. For such a choice, log A = 9(P) ~ P, so that (using the 
prime number theorem) 


log max G (n ) < 

n<N *—' p' 

P<P F 


Y—— + — T\ 

p a - 1 M a ' 


(log A) 


1—a 


p<p 


(1 — a) log log A 


+ 


log A 


M a log log A 
(3.13) 
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From (3.12) and (3.13) it follows that M should be of order log N for optimality. So 
taking M = A log N (with A > 0), (3.10), (3.12) and (3.13) then imply 


log <J> a (N) 


< 




l—a 


i 


1 —a 


2(1 - a) (a + /J - 1) 


- i yiogAQ 

2 ( 1 — 0 ') 2A“ / log log N 


for every € (1 — a, a) and A > 0. Since a+ ^_ l decreases with the optimal 
choice is to take fi arbitrarily close to a. Hence we require inf^>o h( A), where 


/i(A) = 


aX l ~ a 1 1 

(1 — a)(2a — 1) (1 — a) + A“ 


Since h\ A) = ^ +l ( ^—[ — 1), we see that the optimal choice is indeed A = 2a — 1. 
Substituting this value of A gives 


log Q a (N)< 


(1 + (2a- 1)““) (logA0 1_ “ 
2(l — o:) log log iV 


For a = 1, we use the same function g(n) as before (though with possibly differ¬ 
ent values of M and /)). From (3.11) it follows that 


E 

n<N 


g(») 

n 


^n(i) 

p<M v P 7 


n 

p>M 


i + 


\ 

pHp - 1 ))' 


ByMertens’Theorem,thefirstproductise y logM-|-(?(l),whileM^ p 1 /! = 

0(1/log M), and so 

- < (e v log M + 0(1)^ exp{0(l/ log M)} — e v log M + 0(1). (3.14) 

n<N 



For the G ( n ) term we have, as for the a < 1 case, 


Thus, with N 


G(p k ) < 


1 | 1 

1 - j + M(1 — pP~ l ) 


2.3... P, 



1-1 /P 
M(l — pP- 1 ) 


) 


= (e^logP + 0(l))(l + 

Taking M = log N and noting that P ~ log N, the right-hand side is e v log log N + 
0(1). Combining with (3.14) shows that 


3>i(A0 < e y \og\ogN + 0(1). 




















102 


Titus Hilberdink 


The case a = The function g as chosen for a e (|, 1] is not suitable for 
an upper bound as we would require | < ~! Instead we take g to be the 

multiplicative function following: for a prime power p k , let 


g(p k ) = minjl, ( 


M 


P k ( log p) 2 



Here M > 0 is independent of p and k and will be determined later. Thus g(p k ) — 1 
if and only if p k (log p) 2 < M. Note that g(p k ) < g(p) < 1 for all k > I and all 
primes p. Thus (3.11) holds with a — k and (using the prime number theorem) 


lo § X] 


g(n) 


< 


E 


n<N 


M 


Jp- i 


+ s/m J2 


1 


M 


s[M 


p log p log M 


(3.15) 


'(log MV 


'dog Mp 


(The first sum is of order s[~M /(log M) 2 and the main contribution comes from the 
second term.) 

Regarding G(n), this time we have 2 





E 1o 8- 


so that 


log G(n) < ^ 


1 1 \ ^ 

—== — t + y, kXo %p 

sfp- 1 s/M “ 


< 1 0|» + ^ 


1 


s[M 


P\n ” ‘ ’ p K \\n ' p\n 

The right-hand side above is maximal when n = N — 2.3 ... P , hence 


sfp- 1 


log max G(n ) < 
n<N 


log N ^ _J_ 
sTM l^psTP 


logN 2y / Iog ~N 
sfM log log N ' 


Combining with (3.15), (3.10) then gives 


logO i (AO < 
2 


\[~M log N yiog ~N 

2 log M 2 sfM log log A'' - 


The optimal choice for M is easily seen to be M = log N log log N , and this gives 
the upper bound in (iii). 

Now we proceed to give lower bounds. 

For a fixed n e N, let 


= —= if d | n, and zero otherwise. 

y/d(n) 


2 Here as usual, p k \\n means p k \n but p k + 1 \n. 
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Then ||a|| 2 = 1, while for d\n 


bd 


1 1 _ G-a(d) 

~7W)' 


Hence for N > n. 


X>i 2 >I>3 = 

k<N d\n 


1 

d(n) 


E O-a(d) 2 =: rj a (n). 

d\n 


With this notation <i> a (N) > max„<jv \Jrfa(n), and the lower bounds follow from 
the maximal order of r) a (n). For ^ < a, < 1 this can be found easily. Since r] a (p ) = 
1 + p~ a + 2 P~ 2a for p prime, we find for n = 2.3 ... P (so that log n ~ P) that 


'?«(") = n 0+4+°(i))=“pk 1 +°( ! )) e 4} 

P <p ' 1 ' /j<p 1 ’ 

((1 +o(l))P 1 -“) ((1 + o(l))(log//) 1 -“) 

= exp <-: = exp <-> . 

( (1 — a) logP ) ( (1 — a) loglogn ) 

Now, if tk is the k th number of the form 2.3 ■ ■ ■ P (i.e., t k = p\ • • • Pk)-, then log tk ~ 
klogk logtyt+i- Hence for tk < N < lk +\, log N ~ k log k. It follows that 


®a(N) > sjri a (t k ) > exp 


(l+od^qogfjfc) 1 ^ ) 

2(1 -a) log log t k ) CXP 


(l+oUmiogiV) 1 -” ) 
2(1 — a) log log N ) 


For a = 1, we have to be a little subtler to obtain max„<N s/p\(n) — e v log log N + 
0(1). We omit the details, which can be found in [17]. 

For the case a — the above choice doesn’t give the correct order and we lose 
a power of log log N . Instead, we follow an idea of Soundararajan [31]. Let / be the 
multiplicative function supported on the squarefree numbers whose values at primes 
p is 

„ , ( (M)t/2 i for M < p < R. 

f(n) — l v P ’ lo §P - f - > 

| 0, otherwise. 

Here M — log A 7 (log log N ) as before and log R = (log M) 2 . 

Now take a n — f(n)F(N)~ 1 ^ 2 , where F(N ) = /( ,7 ) 2 so that 

Hn<N a n = 1- Then by Holder’s inequality 



F(N) 


N 

£ 

n =1 


/(») 

s/n 


E 

d\n 


1 f ( n ) 

F(N) 

n<N v 


E /(^) 2 - 

d < N/n 
(n,d) — \ 


(3.16) 
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Now using ‘Rankin’s trick’ 3 we have, for any /3 > 0 

/(«) 


EEE E /w 2 =E{^( E /w) 2 - E /«> 2 ) 

<N/n n<N d>l d>N/n ' 

,d)= 1 (n,rf) = l («,</)= 1 

= e £g(no+/« 2 )+o((^)' n o+//w 2 ))). 

n<W v "i- 77 


n 1 / 2 

n<N d 

(n,d)= 1 


/>!« 


P 1" 


(3.17) 


The O-term in (3.17) is at most a constant times 

l - Y f(n)n^ 1/2 Y\{l + p^f(p) 2 ) < T^n(l + P P f(P) 2 + P fi ~ 1,2 nP))' 


NP 

n<N p\n P 

while the main term in (3.17) is (using Rankin’s trick again) 


no+/« 2 +4fr)+no++/- /2 /o>>))- 

P ^ ' P ' 

Hence (3.16) implies 

(f>-f * /4(n( i+/w2+ ^w) 

no +//^) 2 + p^rp) 

The ratio of the O-term to the main term on the right is less than 
exp j-£ log N'T Y (P P ~ l )(f(P) 

which equals 

expj-yg logiV + Y (P^-l)( 

* A/f ^ ™ ^ D V 


M <p<R 


\2 , fiP]_ 

«l/2 


M 


+ 


M 1 ' 2 


M<p<R 


, P(logp ) 2 J5»(log/7) 


Take /J = (logM) 3 . The term involving M 1//2 is at most (log /V) 1 : ' 2+E for every 
s > 0, while the remaining terms in the exponent are (by the 


3 If c n > 0, then for any ft > 0, En>x c n < •* ^ EnE 1 . 
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prime number theorem in the form jt(.x) = li(jc) + 0(x( logx) A ) for all A) 


-/llog N + M [ R -—-dt + o 
J M t(\ogt) 3 

' R dt 


t»- 1 
(log/) 3 


log N 


= — ft log N + fM 


/ 

JM 


(log log N) a 


+ <9 B 2 M 


im t(\ogt ) 2 
log N log log log N 

~ — P-, 

log log N 

after some calculations. 

Finally, since F(N) < ]~[ (1 + f(p) 2 ), this implies 


f R dt \ 

JM t log t) 


®i/2(n) > x ~ n o 

M<p<R 

for all N sufficiently large. Hence 


+ 


f(p) 


P 1/2 (l + f{p) 2 ) 


)• 


log 4>i /2 (N)> M 1/2 £ 


logN \!/2 


M<p<R 


p(\og p) v log log N 


/ log tv \ 
' l02 l02 N ' 


as required. □ 

Remark. The result for \ < a < 1 is 

flog*) 1 "" < log A> a (N) < (1 + ^-D-) dog 

2(1 — a) log log N ° 2(1—a) loglogfV 

It would be nice to obtain an asymptotic formula for log (N). Indeed, it is possible 
to improve the lower bound at the cost of more work by using the method for the case 
a = but we have not been able to obtain the same upper and lower limits. 


4.2 Connections between 4>« (N) and the order of |£(a + it) | The lower 
bounds obtained for Oa ( N ) for ^ < a < 1 can be used to obtain information regard¬ 
ing the maximum order of £(s) on the line div = a.. 


Proposition 4.2. With h n = n d 01 a„/d we have, for any a, 


£ \bn \ 2 

n<N 


£ 

m,n<N 


a m a n {m,n) 2a 

m a n a 



1 

k?»' 


Proof. We have 


I b n | — b n b n 


1 

n 2a 


£ c 0l d 0l a c a ( i 

c\n,d\n 


i 

n 2a 


£ c a d 0l a c a ( i , 

[c,d]\n 
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since c\n, d\n if and only if [c, d]\n. Hence 


E \bn\ 2 = J2 c a d a a c a d £ ^ 

n<N c,d<N n<N,[c,d]\n 


E 

c,d<N 


c a d a a c ci d ^ 

[c.d¥“ , 


1 

5P«’ 


by writing n = [c, d ]A'. Since (c, d)[c.d] = cd, the result follows. 
It follows that if a n > 0 for all n and a, > k , then 


E i 6 »i 2 

n<N 


< 


Z(2ot) 


a m a n ( m.n) 2a 


m,n<N 


(mn)° 


□ 


(3.18) 


Theorem 4.3. Let | < a < 1 and let a e l 2 with \\a ^2 = 1- Let A^{t) = 
Hn=i a ntt u ■ Let N < T x , where 0 < A < |(a — A). Then for some t] > 0, 


1 

T 



\$(a+ it)\ 2 \A N (t)\ 2 dt = l;(2a) £ 

m,n<N 


a m a n (m,n) 2ci 

(mn) a 


+ 0(T~ V ). (3.19) 


Proof. We shall assume \ < a < 1, adjusting the proof for the case a = 1 after¬ 
wards. For a f 1, we can integrate from 0 to T since the error involved is at most 
0{N / T) = 0{T~ V ). 

Starting from the approximation £(a + it) = YL n <t n ait + 0(t~ a ), we have 


l£(« + it) | 2 


E 

n<t 


1 

YlCi+it 


2 

+ o(t'- 2a ). 


Let k, l e N such that (k, /) = 1. Let M = ma x{k, 1} < T. The above gives 


pT /k \ _ 1 ^ / k \ it 

i | f(“ + ")l 2 (j) d ' = l E^+77 ( 7 ) ‘lt + 0(T -°) 


The integral on the right is 
-T 


f E —(—)"n= T —f (—)“ 

Jo JtL, (mn) a v lit / ^ ( mn )a J mm{m n) \ In > 


dt. 


m,n<t x . ' m,n<T v 7 " maxfrn.n} 

The terms with Em = In (which implies m = rl, n = rk with r integral) contribute 

T-rM £(2a)„ „ / M 2 “- 1 T’ 2 “ 2 “' 


— Y- 

(E/)“ ^ 


r 2a 


r<T/M 


ikl) 


t) m /M T - "*\ 

“ + °( (E/)“ ) 
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The remaining terms contribute at most 
1 


^ £ 


(m«)"| log t^ 1 


m,n < T 
km 7^ In 


< 2M 2a Y 


1 


(km In ) 01 1 logoi' 


m,n < T 
km ^ In 


< 2M 2a Y 


1 


m\ < kT,ni < IT 
m\ 7^ n\ 


< 2 M 2a J2 


m\,n\ < MT 

mi ± m 


(m 1 « 1 )“ | log ^ | 

1 

(m 1 « 1 )“|log^| 


= 0(M la (MT) 2 ~ 2a \og(MT)) 
= 0(M 2 r 2 “ 2 “logD, 


using Lemma 7.2 from [33]. Hence 

£ \^a + it)\ 2 [j)“ dt = + 0(M 2 T 2 ~ 2a log T). 

It follows that for any positive integers mji < T, 


f T \Ha + d, = » 2 “><”’.") 2 ° r+0(max) ,„,„) 2 T 2-2« 

Jo \n) (777 77)“ 


l°g T). 


Thus, with Au(t) = J2n = l 


J |f(a + it)\ 2 \A N (t)\ 2 dt = Y a m a n J | £(a + 7 0[ 2 (^) dt 

® m,n<N ® 


= f(2a)r £ 


m,n<N 


a m a n (m,n ) 
(mn) a 


2 a 


2 “ log r ^ max{/n,«} 2 |a m a„|Y 
V m,n<N 2 


The sum in the 0-term is at most N 2 (%2 n<N \a n |) 2 < A 73 , using Cauchy-Schwarz. 
Hence 


1 

T 



tf(a + it)\ 2 \A N (t)\ 2 dt = S(2a) Y 

m,n<N 


a m a n (m,n) 2a 

(mn) a 


+ 



iV 3 log7\ 

J'la—l )' 


Since A 73 < 7" 32 and 3A < 2a — 1, the error term is <9(7 v ) for some r] > 0. 











108 


Titus Hilberdink 


If a — 1 we integrate from 1 to T instead and the (9-term above will contain an 
extra log T factor, but this is still 0(T~ V ). □ 

We note that with more care the TV 3 could be turned into an N 2 , so that we can 
take A < a — ^ in the theorem. This is however not too important for us. 


Corollary 4.4. Let 3 < a < 1. Then for every e > 0 and N sufficiently large, 

max | i;(a + it )| > ^(A^-s)-*) + O^N^) (3.20) 

t<N 


for some t] > 0. 


Proof Let a„ > 0 be such that ||a|| 2 = 1, and take N — with A < |(q; — 3). By 
(3.18) and (3.19), 


E i^i 2 ^ 

n<N 



+ it)\ 2 \A N (t)\ 2 dt + 0(7^) 


< 


max |£(o! + it)\ 2 ^- f \A N (t)\ 2 dt + 0(T n ) 

t<T T Jo 

maxima + it)\ 2 Y |a„| 2 (l + 0(N/T)) + 0(T “") 

t<T L —' 


n<N 


using the Montgomery and Vaughan mean value theorem. The implied constants in 
the O-terms depend only on T and not on the sequence {a n }. Taking the supremum 
over all such a , this gives 


<M^V) 2 


sup 

Mh=i 


Y \ b n\ 2 < max |£(<* + it)\ 2 + 0(T v ), 

Z —< t<T 

n<N 


for some rj > 0, and (3.20) follows. 

In particular, this gives the (known) lower bounds 

fc gogr) 1 -* ] 

log log T \ 


□ 


max \lf(a + it)\ > expj- 
t<T I 


for | < a < 1 and max^^ |£(1 + it)\ > e y log log T +0(1) (obtained by Levinson 
in [25]). 

Morever, we can say more about how often \f(a + it )| is as large as this. For 
Ael and c > 0, let 


Fa(T) = ji e [1, T] : |f(l + it)\ > ^ log log T-A [ 

( (cgogT) 1 -" 

F«,c(T) = \t e [0, T] : |£(a + 0)1 > exp 8 ’ 

( 1 log log T 


(3.21) 

(3.210 
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We outline the argument in the case ^ < a < 1. We have 

i>i 2 4(/ +/ 

n^N 1 y j Fo.r(r) J ro 


+ / \(a + it)\ 2 \A N (t)\ 2 dt + 0(r-"). 

>F a . c (T) J\0,T]\F a ,c(T) / 

(3.22) 

The second integral on the right is at most 


exp | 


12c (log 7) 


l—a 


log log T 


}'ff \ A N(t)\ 2 dt = C>(exp{ 


1 2c (log T) 


1 —a 


log log T 


while, by choosing a n — d(N ) 1 ' 2 for/?|A ? and zero otherwise, the LHS of (3.22) is 
at least rj a (N). Every interval , T A ] contains an N of the form 2.3.5 ... P. As 

such, ri a (N) > exp< 


C ?° S i r V —I f° r some c' > 0. Hence, for 2c < c', 

log log T \ 

If 0 9 [c' {\ogT) l ~ a \ 

- |C(Q! + it)\ 2 \A N (t)\ 2 dt > exp|——-—— 1. 

T JF a c (T) 1 2 log log T 1 


T 

We have |£(a + H)| = 0(T V ) for some v and |An(0| 2 < d(N) = 0{T E ), so 


H 


F a ,c(T) 


|+ it)\ 2 \A N (t)\ 2 dt < T 


2v— 1+e 


KFaAT))- 


Thus ii(F a ,c(T))>T 


1—2 v—£ 


In particular, since v < we have: 


for all c sufficiently small. 


Theorem 4.5. For all A sufficiently large (and positive), 

( l°g T 

H(Fa(T)) > T expj-o-—-—- 
( log log T 

for some a > 0, and for all c sufficiently small, lx{F a ^ c {T)) > T {1+2a d 3 for all 
sufficiently large T. Furthermore, under the Lindelof Hypothesis, the exponent can 
be replaced by 1 — e. 


Addendum-. There has been much recent activity in this field. Aistleitner [1], in 
essence restricting (*) to N rows and columns (rather than just the first N) used 
an ingenious idea based on GCD-sums to improve these f2-rcsults for |£(a + it)\ and 

Theorem 3.5 to exp for \ < ot < 1, thereby improving Montgomery’s 

result. More recently still. Bondarenko and Seip [4] used a similar method for the 
case a — j, obtaining 


Zi 

2 


(T) > exp jc 


I log T log log log T ] 
log log T 


for any c < 


l 

sffi' 
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4.3 Second measure - cp a on AT 2 and the function M„ ( T ) For a < 1, 

(p a is unbounded on l 2 and so <p a (l 2 ) f l 2 (by the closed graph theorem 4 - see [32], 
p. 183). However, if we restrict the domain to At 2 , the set of multiplicative elements 
of / 2 , we find that (p(A4 2 ) C l 2 . More generally, if / is multiplicative then, as we 
shall see, cpf{ At 2 ) C l 2 in many cases (and hence At 2 ) C At 2 ). 

Notation. Let At 2 and At 2 denote the subsets of l 2 of multiplicative and completely 
multiplicative functions, respectively. Further, write At 2 + for the non-negative mem¬ 
bers of At 2 , and similarly for At 2 +. 

Let Atg denote the set of At 2 functions / for which / * g E At 2 whenever 
g E At 2 ; that is, 


At 2 = {/ e At 2 : g e M 2 =► f *g e At 2 }. 

Thus for / e At 2 , (pf(M 2 ) C At 2 . 

It was shown in [ 1 8] that At 2 is not closed under Dirichlet convolution, so At q ^ 
At 2 . A criterion for / e At 2 to be in Atg was given, namely: 

Proposition 4.6. Let f E At 2 be such that Y^k= i I /( P k ) I converges for every prime 
p and that JlfeLi \f(P k )\ — A for some constant A independent of p. Then f e Atg. 

On the other hand, if f e At 2 with f > 0 and for some prime po, f(p^) 
decreases with k and Jf'kLi f(Po ) diverges, then f $ Atg. 

The proof is based on the following necessary and sufficient condition: Let f g € 
At 2 he non-negative. Then f * g E At 2 if and only if 

OO 

E E E f(P m Mp n )f(p m+k )g(p n+k ) converges. 

p m,n>lk=0 

This can be proven in a direct manner. 

Thus, in particular, At 2 C Atg. For / e At 2 if and only if \f(p)\ < 1 for all 
primes p and Y2 P \f(p)\ 2 < oo. Thus 


Ei/(/)i 

k— 1 


\f(p)\ 

i-\f(p)\ 


< A, 


independent of p. 

For example, («““) E Atg for a > 

The “quasi-norm” Mf(T). Let / E Atg. The discussion above shows that 
<Pf{ At 2 ) C At 2 but, typically, q>f is not ‘bounded’ on At 2 (if / f Z 1 ) in the sense 


4 Being a ‘matrix’ mapping, <p a is necessarily a closed operator, and so (Pad 2 ) C / 2 implies (p a is bounded. 
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that 5 ||^)yfl||/||a|| is not bounded by a constant for all a e AA 2 . A natural question 
is: how large can ||^/fl|| become as a function of ||a||? It therefore makes sense to 
define, for T > 1, 


M f (T) = 


sup 

a e M. 2 
\\a\\ = T 


\Wf(4 

HI 


We shall consider only the case f(n) — n~ a , although the result below can be ex¬ 
tended without any significant changes to / completely multiplicative and such that 
/|p is regularly varying of index —a with a > 1/2 in the sense that there exists 
a regularly varying function / (of index —a) with f(p) — f(p) for every prime p. 
We shall write Mf(T) — M a (T) in this case. 


Theorem 4.7. As T —> oo 


MAT) = (loglog T + log log log T + 2 log 2—1 + o(l)), (3.23) 

while for ^ < a. < 1, 


log M B (r) 


v (l-a) 2 “ 


+ o(l) 


(log T) l - a 
(log log T ) a ' 


(3.24) 


Here B{x , y) denotes the Beta function / 0 ' l x 1 (I — t) y 1 dt. 

Sketch of proof for the case ^ < a < 1 (for the full proof see [18]). We consider first 
upper bounds. The supremum occurs for a > 0, which we now assume. Write 
a = (a„), cp a a = b = (h n ). Define ot p and f> p for prime p by 


OO 

= and Pp 

k— 1 



k =1 


By the multiplicativity of a and h, T 2 = ||n|| 2 = n p (l + a p) and ll^ll 2 = 0/7(1 + 
f p ). Thus 



Now for k > 1 


k 

b p k = ^2 p~ ra a p k-r = a p k + p~ a b p k -\, 
r = 0 


whence 

b 2 pk = a 2 p k + 2p~ a a p kb pk -i + P~ 2a b 2 pk -i ■ 


5 Here, and throughout this section || • || = || • ||2 is the l 2 - norm. 
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k= 1 


Summing from k = 1 to oo and adding 1 to both sides gives 

oo 

l-\~Pp = l-\-oip-\-2p 01 y ' apkb pk-i -\- p 201 ft p). 

By Cauchy-Schwarz, 

OO , OO OO x 1/2 _ 

J2 apkb p k-l < f J2 a lk b lk -1 ) = Y “/>(! + Pp)> 

k=\ \fc=l k = 1 ' 


(3.25) 


k= 

so, on rearranging, 


(1 + /3p) — 


2P “ y/oi p ( 1 + Pp) ^ 1 + 


1 ~P 


-2a 


< 


1 — P 


-2a ' 


Completing the square we obtain 

{^ +P P -(1-^)2' 

The term on the left inside the square is non-negative for every p since I + p p > 


+ a. 


r, which is greater than 


for cr > Rearranging gives 


1 + Pi 


y i + a p i — p 2a \ p u 

Let y p = Taking the product over all primes p gives 


i + — 


1 + Otr 


(3.26) 


Note that 0 < < 1 and FL — l — = T 2 . The idea is to show now that the main 

— ' ‘ 1 1 —yp 

contribution to the above sum comes from the range p x log T log log T. 

Let e > 0 and put P = log T log log T. We split up the sum on the right-hand 
side of (3.26) into p < aP, aP < p < AP, and p > AP (for a small and A large). 
First, 


E <T.P~ 


a \- ap l-a 


< £ 


(log T) 


l—a 


p<aP 


p<aP 


(1 — a) log P (loglogT)“ 


(3.27) 
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for a sufficiently small. Next, using the fact that log T 2 = log ]~[ —y > ^ y 2 
. p t ~yp ” ^ 


we have 


H p a Yp - ( E p 2 “ H vl) ~ ( 

n-,AP ' V 


p>AP 


y p>AP p>AP 

(log F) 1 -” (log log T)~ a 

A a ~ x l 2 ^/a — 1/2 


'2A l - 2a P l - 2a \ogT\ 
(2a — 1 ) log P / 


1/2 


< e 


(logT) 


1 —a 


(log log T)° 


(3.28) 


for A sufficiently large. This leaves the range a P < p < AP and the problem 
therefore reduces to maximising 

Yp 


E yp 
P a 

aP<p<AP 1 

subject to 0 < y p < 1 and J~[ —— = T 2 . The maximum clearly occurs for y p 

decreasing (if y p / > y p for primes p < p', then the sum increases in value if we 
swap y p and y p >)- Thus we may assume that y p is decreasing. 

By interpolation we may write Yp = g( 77 ), where g\ (0, 00 ) —> (0, 1) is con¬ 
tinuously differentiable and decreasing. Of course, g will depend on P. Let h — 
log pTg! > which is also decreasing. Note that 

2logT = T, h (j) >- E > h(a)n(aP) > cah(a)\ogT, 

P P<aP 

for P sufficiently large, for some constant c > 0. Thus h(a) < C a (independently of 

Now, for F: (0, 00 ) —>• [0, 00 ) decreasing, it follows from the Prime Number 
Theorem in the form n(x) = li(x) + 0( y^ 2 ) that 




(3.29) 


ax<p<bx 

where the implied constant is independent of F (and x). Thus 

21 ° gr - E h (j)~^l "~ ,osT [ 


aP<p<AP 

rOC 


h. 


Since a and A are arbitrary, f 0 h must exist and is at most 2. Also, by (3.29), 

P 1_ “ f A g(u) 

pa =— / Afl — II — I - 77 I 


y yp = A_ y g(£-)(£-) “ ~ — f 

^ AD P a log P J a 


du. 


u u 


aP<p<AP r aP<p<AP 

As a, A are arbitrary, it follows from above and (3.26), (3.27), (3.28) that 


log I, I, < 


SM Au + o ,l,t (togr "- 


0 u 


(log log T)° 
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Thus we need to maximise / 0 °° g(u)u~ a du subject to / Q °° h <2 over all decreasing 
g: (0, oo) —> (0, 1). Since h is decreasing, one finds that xh(x ) —> 0 as x —» oo and 
as x -> 0 + . 

For the supremum, we can consider only those g (and h) which are continuously 
differentiable and strictly decreasing, since we can approximate arbitrarily closely by 
such functions. On writing g = s o h, where s(x) = \/l — e~ x , we have 


r*w d „ = \ s( “ } “ ~T --Lffty 

Jo 11 " 1 - or Jo 1 - a Jo 

i r°° 

= -/ s'(h(u))h'(u)u 1 ~ a du 

l-a Jo 

i rh( 0+) 

= - / s'(x)/(x) 1 ~ a dx, 

1 - a Jo 


: du 


where l = h 1 , since spug ( u ) —> 0 as u —> oo. The final integral is, by Holder’s 
inequality, at most 



) a / n h{ 0+) \ l-o 

a ') ■ 


(3.30) 


But Jq (0 * / = — / 0 °° uh'(u)du = / 0 °° h < 2, so 


f 


g(u) 


du < 



A direct calculation shows that 6 1 — ^). This gives the 

upper bound. 

The proof of the upper bound leads to the optimal choice for g and the lower 
bound. We note that we have equality in (3.30) if I/ (s') 1 /" is constant, i.e., l(x) — 
cs\x) l ' a for some constant c > 0 — chosen so that / 0 °° l = 2. This means that we 
take 

= < s T‘(g)“) = '°sQ + 

from which we can calculate g. In fact, the required lower bound can be found by 
taking a„ completely multiplicative, with a p for p prime defined by 


a P = go 



where P — log T log log T and go is the function 


go(x) = 


1 - 


N 


2 

1 + V 1 + (f ) 2 “ ’ 


6 The integral is 2“ x /“ / 0 °° e~ x/a (l - e- x )~ l/2a dx = 2“ 1 /“/ 0 1 1 1 /““ 1 (1 - t)~ x l 2a dt. 
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with c = 2 1 + 1 /“/5(i, 1 — 2^). As such, by the same methods as before, we have 
||a|| = 7’ 1+ °( 1 ) and 


log 


lllcHI 

HI 


««,(£) + 0(1) 



goH 

U a 


dit. 


By the choice of go, the integral on the right is 


(1 - at) 2“ 


, as required. 


□ 


Remark. These asymptotic formulae bear a strong resemblance to the (conjectured) 
maximal order of | £ (cr + i T) \. It is interesting to note that the bounds found here are 
just larger than what is known about the lower bounds for Z a {T) (see the interlude 
on upper and lower bounds on £(s), especially items (b) and (c)). We note that the 
constant appearing in (3.24) is not Lamzouri’s C(o') since, for a near the former is 
roughly ,— while C(a) 




Lower bounds for <p a and some further speculations. 

of (p a via the function 


m a (T) = 


inf 

a e JA 2 - 
Hall = 7- 


\W a q\\ 

HI 


We can study lower bounds 


Using very similar techniques, one obtains results analogous to Theorem 4.7: 
1 6e y 

—— = —y (log I 0 ® T + log log log T + 2 log 2 — 1 + o(l)) 
mi(I) tc z 


and 


1 

m a (T ) (1 -ot)2 “(loglogr)“ 


for 4 < a < 1. 


We see that m a (T) corresponds closely to the conjectured minimal order of |£(a + 
iT)\ (see [13] and [27]). 

The above formulae suggest that the supremum (respectively infimum) of 
||(Pq.cz||/ 1|a|| with a e Ad 2 and ||cz|| = T are close to the supremum (resp. infi¬ 
mum) of |£ a | on [1, T], One could therefore speculate further that there is a close 
connection between ||^ a a||/||cz|| (for such a) and |£(a + iT)\. 

Heuristic ally, we could argue as follows. Consider 


1 

T 



\^(a — it)\ 2 



n—\ 


2 

dt. 


(3.31) 


This is less than 


Z a (T) 2 ■ — 


1 r T 00 

•-/ J^a n n lt 

1 J ° n = 1 


dt ~ Z a (T) 2 \\a\\ 2 , 
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by the Montgomery-Vaughan mean value theorem (under appropriate conditions). 
On the other hand, (3.31) is expected to be approximately 


1 

T 



OO 

dt I 2 = H < aH| 2 - 

n — \ 


Putting these together gives 


\Wcta\\ 

HI 


< Z a (T). 


The left-hand side, as a function of ||a||, can be made as large as F( ||fl||), where 
F(x) = exp {c a Qogiogx) a ^ the above continues to hold for ||a|| as large as T , then 
M a (T) < Z a (T) would follow. Even if it holds for ||a|| as large as a smaller power 
of T, one would recover Montgomery’s 0-result. 

Alternatively, considering (3.31) over [T, 2T], 


1 

T 



\^(a — it)\ 2 


T, a n n lt 


n =1 


2 

dt = \^(a + it 0 )\ 2 


| r2T °° 

— / a n n l 


dt (3.32) 


for some to e [7', 2 T] and, using the Montgomery-Vaughan mean value theorem 
(assuming it applies), this is approximately 

OO 

\i;(a + it 0 )\ 2 y\a n \ 2 = [t(a + /i 0 )| 2 H| 2 . 

n — \ 


On the other hand, formally multiplying out the integrand, by writing (,{ct — it) = 
Yy=\ and formally multiplying out the integrand, the left-hand side of (3.32) 
becomes (heuristically) 

2 oo 

dt \b n \ 2 = IkcHI 2 - 

n — \ 

Equating these gives 

—— % £(or + it 0 )\. 

I|a|| 

Clearly there are a number of problems with this. For a start, we need (p a a e 
l 2 . More importantly, the error term in the Montgomery-Vaughan theorem contains 
Yy=\ n HI 2 , which may diverge. Also, a n and hence |a|| may depend on T, and 
finally, the series for £(a — it) doesn’t converge for a < 1. 

If a n — 0 for n > N. the above argument can be made to work, even for N 
a (small) power of T (see for example [17]), but difficulties arise for larger powers 
of T. 

There seem to be some reasons to believe that the error from the Montgomery- 
Vaughan theorem should be much smaller when considering products. These occur 
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when a n is multiplicative. For example (with Q = \\ p< p P so that log Q = 6(P) ~ 
P by the Prime Number Theorem), 


1 

2 T 



+ P U ) 



The ‘main term’ is d{Q) = 2 n ^ p K while the error is at least jr- However the left- 
hand side is trivially at most 4*^', so the error dominates the other terms if P > 
(1+5) log T. If, say, P is of order log T log log T (which is the range of interest), 
then n(P ) x log T, so 2 3r t /> ) i s like a power of T, but 0 is roughly like ragtag r _ 
far too large. 

Thus it may be that for a n completely multiplicative, it holds that 


-2 T 


y J i^o-tor 


n I „ a v u dt ~ l?(“ + n Y~\a 

p<p 1 a p p p<p 1 1a ' 


for P up to c log T log log T. This suggests the following might be true: 
(a) given a e A 2 with ||a|| = T, there exists t e [T, 2 T] such that 


IIjgotgjl 

Ml 


« 1+ it )|. 


(b) given T > 1, there exists a e A4 2 with ||a|| = T such that 

\W«a\\ 

—— % \£(a + iT)\. 

Ml 

Here, % means something like log-asymptotic, ~, or possibly even =. Thus (a) 
implies M a {T) < Z a (T), while (b) implies the opposite. Together they would imply 
we can encode real numbers into A 2 -functions with equal / 2 -norm, such that <p a has 
a similar action as t, a . 


Closure of AiB 2 ? We finish these speculations with a final plausible conjecture 
regarding the closure under multiplication of functions in B 2 with multiplicative co¬ 
efficients. 

Let denote the subset At B 2 of functions / for which / e AIq. Recall 

that AIq is the subset of At 2 for which g e A 2 A f * g e Ai 2 . This suggests the 
following conjecture: 

Conjecture. Let / e A B 2 and g e AqB 2 . Then fg e AiB 2 . 

In particular, A B 2 A C B 2 = Ai B 2 . Since ij a e A„B 2 forcr >b this would 
imply e A B 2 for every k e Nanda > which implies the Lindelof hypothesis. 
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5 Connections to matrices of the form 


(/(07(i,y) 2 ))/j<jv 


The asymptotic formulae for ( N) in Theorem 4.1 can be used to obtain informa¬ 
tion on the largest eigenvalue of certain arithmetical matrices. Various authors have 
discussed asymptotic estimates of eigenvalues and determinants of arithmetical ma¬ 
trices (see for example [6], [7], [26] to name just a few). 

Let Ajy(f) denote the N x N matrix with ij th -entry /(// /) if j \i and zero other¬ 
wise. As noted in the introduction, these matrices behave much like Dirichlet series 
with coefficients /(«); namely, 


4w(/Miv(g) = A N (f * g). 


In particular, A yy (/) is invertible if / has a Dirichlet inverse, i.e., /(1) ^ 0, in which 


case A N (f) 1 = A N (f 1 ). 


Suppose for simplicity that / is a real arithmetical function. (For complex values 
we can easily adjust.) Let 



where b n = Yld\ n f(d) a n/d ■ Observe that O f(N ) 2 is the largest eigenvalue of the 


matrix 


An(/) T A N (f). 


Indeed, we have 



so that, on noting /, j \n if and only if [i, j]\n (where [/, j] denotes the 1cm of i and j) 


N 


N 



where (using (/,./)[/, /] = ij ) 



ki \ / kj 


But is also the i y' th -entry of A n(J) T d N (./), as an easy calculation shows. Thus 


<$>f(N) 2 = sup y~] b^ciiaj 

. ? , . . > 


a \H- ^ a N~ 1 iJ<N 


(3.33) 
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is the largest eigenvalue of Au{f) T i.e., Oy (V) the largest singular value 1 
of An (f). Thus an equivalent formulation of Corollary 3.2 for / supported on N is: 
For f e Z 1 non-negative, the largest singular value of A^if) tends to ||/||i. 

Now if / is completely multiplicative, then 


b) 


(N) 


= E 


k<~N— 
— IT7T 


which for large N is roughly ||/|||/( ^.^ 2 ) for / e l 2 . This suggests that the matrix 
(/( y IJ j-) 2 ))j j<s/ ^ as ^ ts l ar § est eigenvalue close to Oy(A’) 2 /||/|||. This is indeed 
the case. 


Corollary 5.1. Let f e l 2 be non-negative and completely multiplicative. Let A ,y 
denote the largest eigenvalue of (f( ^ lJ .^ 2 )) t j <N ■ Then 

Q/(ZV 3 ) 2 

ll/ll! " N ~Ek=im 2 ' 


In particular, for f e / 1 , 


lim An — 

N^-oo 


2 
J_ 
2 ’ 
2 


Proof We have 

= 2 su p 2 E 

a l+'"+ a A? = 1 i,j<N y ' I > 

When / > 0, the supremums in (3.33) and (3.34) are reached for a„ > 0. Thus, 


(3.34) 


d>/(ZV ) 2 < 


\A N 


follows immediately. 

On the other hand, for i , j < iV, [/, j] < N 2 so 



Taking the supremum over all such a n gives, f{N 2 ) 2 > An J2k<N f(k) 2 > as 
required. 

Finally, if f € /^then $f(N) —> ||/||i and so An —> I4I4- follows. □ 

11/ II 2 


7 The singular values of a matrix A are the square roots of the eigenvalues of A T A (or A* A if A has 
complex entries). 
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The approximate formulae for d> a ( N ) in Theorem 4.1 lead to: 

Corollary 5.2. Let f{n) = n~ a and let An (a) denote the largest eigenvalue of 

Tt “" 


Ajv (1> = —7(e r log log N + Oil)) 2 , 


71 * 


log A N (a) 


log Ajv (^) 


(log AO 


1—a 


log log N 

I log N 
y log log N 


for \ < a < 1, 
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1 Introduction 

In comparison with geometry, number theory, analysis, or even algebra and topology, 
graph theory is a rather young mathematical discipline. It has undergone crucial 
developments with important theoretical advances as well as closer connections to 
other mathematical fields and applications in extra-mathematical areas only within 
the last 80 years. The term graph was introduced by Sylvester in 1878 in an article 
published in “Nature”, the by far most influential scientific magazine worldwide. 

By now graph theory is without any doubt an important mathematical discipline 
in its own right with prestigious topics, results and open questions, as for instance 
the four color problem, just to name the most famous one. At the same time, it 
is influential for many other mathematical fields, not only in discrete mathematics, 
but also for areas such as algebra, topology and even probability theory. Naturally, 
the development of graph theory benefited from the rapid progress in computers and 
computer science in the 20th century. 

However, graph theory has yet another facet to it, namely it may serve as a math¬ 
ematical “playground” in the following sense. Quite a few mathematical concepts in 
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other fields, as for instance topological items or the variety of zeta functions, related 
to number theory, algebra, differential geometry, or other subjects, have a counterpart 
in the world of (finite) graphs. Looking at and experimenting with these somewhat 
discrete relatives sometimes helps to better understand the original objects. 

Algebraic graph theory - nomen est omen - connects algebra and graph the¬ 
ory. Cayley graphs, encoding structural features of groups in correlated graphs, are 
a prominent example studied in this rather young mathematical field. Besides the nec¬ 
essary basics of algebraic graph theory, these lecture notes will present three topics 
connecting graph theory and arithmetic. 


2 Some (algebraic) graph theory 

Here we give a short introduction to the concepts of graph theory that will be used 
in the sequel. Besides some standard basics like cycle graphs, trees, regular graphs, 
spanning subgraphs, etc., which can be found in almost any introductory book on 
graph theory (e.g. [12]) and are mainly included to set up the notation, the reader will 
find more specialized results on circulant graphs and Cayley graphs. The introduction 
to algebraic graph theory deals with the adjacency matrix and the spectrum of graphs, 
in particular the spectrum of circulant and integral circulant graphs. 

2.1 Basics and some specialities on graphs 

Definition 2.1. 

• A (finite ) graph Q = ( V, E) consists of a (finite) non-empty set V of vertices 
and a set E c V x V of edges. The order of Q is the number of vertices | V | in 

G. 

We usually write tqtq rather than (tq, iq), and for tqtq e E we say that tq 
and tq are adjacent or neighbors in Q, written iq ~ iq. 

• The graph Q is called undirected if E is symmetric, i.e., if iq ~ iq tq ~ 
iq for all v 1 , tq e V, and otherwise directed. For undirected graphs we shall 
not distinguish between iqiq and tqu 1 (here tqtq represents a set rather than 
an ordered pair). 

• Edges of type vv e E are called loops. A loopfree graph has no loops. 

For several applications it is useful to allow graphs with multiple edges between 
vertices, so-called multigraphs. We call graphs without loops or multiple edges sim¬ 
ple. 

Assumption. Unless explicitly stated otherwise, the graphs Q = (V. E) we consider 
are always simple, finite, and undirected. 
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Example 2.2. Let n be a positive integer. 

(i) The complete graph K, n = (V, E) of order n has V\ = n and E = V x V \ 
{vv : ve V}. Clearly, \E\ = 

(ii) The cycle graph C n — (V, E) of order n has V = {ui, v 2 ,..., v n }, say, and 
E = {ViV2,V 2 V 3 ,...,V n -iV n ,V n V 1 }. 

(iii) The graph (Zio, E{ 13 , 7 , 9 }) with Zio := Z/10Z (cf. Fig. 4.2) is dehned by 

E{ C ] ,...,crl \ a b e ZtoxZto : a—b = Cj mod 10 for some j e { 1 ,..., r}}. 

It is an example of a circulant graph (see Example 2.19), as well as of a Cayley 
graph (see Example 2.12 (iii)). 

The structure of a graph does not depend on how the vertices are named. We 
already made use of this in the preceding examples by speaking of the complete 
graph and the cycle graph of order n, respectively, without specifying the elements of 
V. Accordingly, two graphs Q\ = (V \, E\) and Q 2 — (V 2 , E 2 ) are called isomorphic, 
written Q\ ~ Q 2 , if there is a bijective map <p : V\ —> V 2 such that 1 ; 1 ~ v 2 in Q\ 
if and only if <p(ui) ~ <p{v 2 ) in Q 2 . From now on, we shall not distinguish between 
isomorphic graphs. 

Definition 2.3. Let Q = ( V, E) and Q' = (V', E') be graphs with V' C V. 

• Q' is called a subgraph of Q if E' c {V’ x V’) H E. 

• In the special case where E' — {V x V’) Cl E, we say that Q’ is induced by its 
vertex set V'. 

• A subgraph Q' of Q is called a spanning subgraph if V' = V (cf. Fig. 4.3). 
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Figure 4.2. Circulant Cayley graph (Ziq, l, 3 , 7 , 9 }) 



Figure 4.3. Graph with (disconnected) spanning subgraph (solid lines) 










4 Arithmetical Topics in Algebraic Graph Theory 


127 


An important type of subgraph in a given graph is a clique , i.e., a subgraph which 
is a complete graph. 

Definition 2.4. Let Q = ( V, E) be a (directed) graph. 

• A (directed) path or walk from u e V to v e V of length A: in is a sequence of 
k + 1 vertices u = up, Vi, V 2 , ■. ■, fjt-i, = t e F such that u ;_\Vi e E for 
i — 1,2 ,... ,k (or, equivalently, the sequence of the corresponding k edges). 
The path is called simple if v t ^ Vj for i = 1,2,..., k — 1 and all j. For 
a path P — (up, V\,V 2 , ■ ■ ■, Vk- 1 , Vk), its length is denoted by v(P) k. 

• A (directed) cycle or closed walk in Q is a (directed) path from v to v for some 
v e V. 

• If C = (up, Vi,..., Vk- 1 , Vo) is a cycle in Q , then 

C n := ( up, ui ,..., i, tip, i>i,..., Vk— i , ,vo,vi,...,Vk-i ,vo) 

n times 


is called the /7-th power of C. 

Definition 2.5. Let Q = (V, E) be a graph. 

• Q is called connected if there is a path from u to v for any vertices u ^ v in Q; 
otherwise Q is called disconnected. 

• If ^7 is disconnected, its induced connected subgraphs Q, — (Vi, Ei) with pair¬ 
wise disjoint Vj C V satisfying (J Lj = V are called the components of Q. 

All graphs in Example 2.2 are connected and contain cycles. 

Example 2.6. The graph (Zip, £{ 2 , 8 }) is disconnected with two components. 

If a graph Q contains a cycle, we call Q cyclic, otherwise we call it acyclic. We 

introduce the very important subclass of graphs which are connected, but acyclic. 

Definition 2.7. A tree is a connected acyclic graph (cf. Fig. 4.5). 

Exercise 2.8. Show that the following assertions are equivalent for a graph T — 

{V,E): 

(i) T is a tree. 

(ii) There is a unique path in T between any two distinct vertices of T. 

(iii) T is maximally acyclic, i.e., T is acyclic, but T' := (V, E U {e}) is cyclic for 
all ce(FxF)\ E. 

(iv) T is minimally connected, i.e., T is connected, but T" (V, E \ {e}) is 
disconnected for every e e E. This means that \E\ — \ V\ — 
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Proposition 2.9. Every connected graph Q contains a spanning tree, which by defi¬ 
nition means that there is a spanning subgraph of Q which is a tree. 

Proof. Q contains a connected spanning subgraph, e.g., Q itself. Now assume that Q' 
is a connected spanning subgraph of Q with a minimal number of edges. Then the 
equivalence of (i) and (iv) in Exercise 2.8 tells us that Q' is a tree. □ 

Definition 2.10. Let Q — (V, E) be a graph. 

• The degree of v e V is d(v) \{u e V : u ~ u}|, i.e., the number of 
neighbors of v. 

• A vertex of degree 1 is called a leaf. 

• If all vertices in V have the same degree (let’s say k), then Q is called regular 
or, more precisely, k-regular. 

Exercise 2.11. Every tree with more than two vertices has at least two leaves. 
Example 2.12. 

(i) Each cycle graph C n is 2-regular. 

(ii) The complete graph K, n is (n — l)-regular. 

(iii) The circulant graph (Zio, E) from Example 2.2 is 4-regular. 
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Figure 4.5. Tree 


Definition 2.13. Let G be a finite group and S C G. We usually assume that S is 
a generating set of G and that S is symmetric, i.e., seS implies s~ 1 e S. 

(i) Then the corresponding Cayley graph Cay (G, 5) = (V, E) is defined by V : = 
G and E := {(g, gs) : g e G, s e S}. 

(ii) A Cayley graph Cay(Z„,Z*) is called unitary, where Z„ := Z//?Z is the 
additive group of residues mod n for some positive integer n and Z* = {1 < 
a < n : (a,n) = 1 } is the multiplicative unit group in the residue class ring 

^n- 

Example 2.14. Clearly, (Zio, £{ 1 , 3 , 7 , 9 }) = Cay(Z 10 , {±1, ±3}) (cf. Example 2.2 

(iii)) is a unitary Cayley graph, and (Zjo, £{ 2 , 8 }) — Cay(Zio, {±2}) (cf. Example 
2.6) is a disconnected Cayley graph. 

Exercise 2.15. Show that 

(i) Cay(G, S ) is connected if and only if S is a generating set. 

(ii) Cay(G, S) is undirected if and only if S is symmetric. 

(iii) Cay(G, S ) is regular of degree |S|. 

Definition 2.16. A graph Q — ( V, E) is called bipartite if V is the union of two 
disjoint subsets Vi and V 2 such that E c {V\ x V 2 ) U (V 2 x V\) (cf. Fig. 4.6). 

Exercise 2.17. 

(i) Show that a bipartite graph cannot contain a cycle of odd length. 

(ii) Prove that bipartite graphs are characterized by the property in (i). 

Hint: Assume the graph is connected and consider a spanning tree. 
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Figure 4.6. A bipartite graph 


2.2 Some algebraic graph theory Much more than what we need from alge¬ 
braic graph theory can be found in [7] of [14]. A tool which uniquely determines 
a graph and, more importantly, enables us to apply (linear) algebra to examine the 
graph and its properties is the adjacency matrix. 


Definition 2.18. Let Q = (V, E ) be a graph with V — { v i ,..., v„ }, which may be 
directed and is allowed to have loops. The adjacency matrix Ag = (aij) nxn of Q is 
defined by setting 


a ij 


0, if Vi'/'Vj, 

1, if Vi ~ vj for i ± j, 

2, if Vi ~ Vi for i = j. 


Example 2.19. Consider the Cayley graph Cay(Zio, {±1, ±3}) with Zio — 
{0,1,2,..., 9} from Example 2.2 (iii). Then 


^Cay(Zio,{±l,±3» 


/0 1 0 
1 0 1 
0 1 0 
1 0 1 
0 1 0 
0 0 1 
0 0 0 
1 0 0 
0 1 0 
\l 0 1 


10 0 0 
0 10 0 
10 10 
0 10 1 
10 10 
0 10 1 
10 10 
0 10 1 
0 0 10 
0 0 0 1 


1 0 1 \ 
0 1 0 
0 0 1 
0 0 0 
1 0 0 
0 1 0 
1 0 1 
0 1 0 
1 0 1 
0 1 0 


Exercise 2.20. Let Q be an undirected graph, hence Ag is a real symmetric matrix. 
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Figure 4.7. Two non-isomorphic cospectral graphs 


(i) Let x and y be eigenvectors of Ag corresponding to different eigenvalues. 
Show that x and y are orthogonal. 

(ii) Show that all eigenvalues of Ag are real. 

(iii) Verify that all rational eigenvalues of Ag are integers. 

Definition 2.21. Let Q = (V, E) be a graph. The eigenvalues ofQ are the eigenval¬ 
ues of Ag. We call the multiset (= set with repetitions) of all eigenvalues of G the 
spectrum Specif/) := Spcc(dg) of Q (where Specif/) c M by Exercise 2.20). 

We have seen in Section 2.1 that renaming the vertices of a graph Q\ gives an 
isomorphic graph f/ 2 . Usually Ag 1 f Ag 2 , but we have 

Proposition 2.22. If two graphs G\ and Q 2 are isomorphic, then Spec(f/j) = 
Spec(f/ 2 ). 

Proof. Isomorphic graphs have the same number of vertices, thus assume that the 
sets V\ and V 2 of vertices of Q\ and f/ 2 , respectively, are V\ — {v i,..., v n } and 
V 2 — {ui, ..., u n }. Since Q\ ~ f/ 2 , we have u\ = v a Q) for / = 1,..., n with some 
permutation a on {1,2,..., n}. If P a denotes the n x n permutation matrix of cr, 
we have P^Ag^ P a — Ag 2 . So Ag x and Ag 2 are similar matrices, and therefore they 
have the same characteristic polynomial, hence the same eigenvalues. □ 

Exercise 2.23. 

(i) Show that Spec(f/i) = Spec(f/ 2 ) (therefore G\ and f/ 2 are called cospectral, or 
isospectral), but Q\ f Q 2 for the two graphs Q\ and f/ 2 in Fig. 4.7. This means 
that the spectrum of a graph does not determine the structure of the graph. In 
other words, “you cannot hear the shape of a graph”, varying a famous question 
of Marc Kac posed in 1966. 

In particular, the graph on the right obviously is planar, i.e., it can be embedded 
in the plane, while the graph on the left is not planar. 
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(ii) What was the famous question of Marc Kac referred to in (i)? 

(iii) Look at some examples to find out that isomorphic graphs usually do not have 
the same eigenspaces! 

A nice property of the adjacency matrix and its powers for a directed graph is 
given by the following result, which is proved by straightforward induction. 

Proposition 2.24. IfQ = ( V, E) is a directed graph, then the number of (simple and 
nonsimple) directed paths of length l from u E V to v E V is (Ag) uv . 

Exercise 2.25. Let Q be a graph (loopfree and undirected) with e edges and t triangles. 

(i) Show that X^AeSpec(g) k = ti'/lg = 0, where tr denotes the trace of a matrix. 

The sum of the absolute values of the eigenvalues S(Q) := XasSpec(cj) I'M * s 
called the energy of Q. 

(ii) Use Proposition 2.24 to prove that tr Ag — 2e and tr Ag = 6 1. 

(iii) What about the corresponding problem for quadrilaterals? 

The spectrum of a graph reflects quite a few of its properties. We are particularly 
interested in regular graphs. 

Proposition 2.26. Let Q be a k-regular graph. 

(i) Then k E Spec(t/), and |A| < k for all A E Spec((/). 

(ii) IfQ is connected, then k has multiplicity 1 in Spcc(ty). 

(iii) IfQ is connected, then —k E Spec(t/) if and only ifQ is bipartite. 

Proof Let Q — ( V. E) with V = {iq,..., v n }. Since Q is A-regular, every row of 
Ag contains k entries 1 and n — k entries 0. Hence 


( l \ 


/ k \ 



1 


k 

— k . 

1 

\V 


\k) 

— A. 



i.e., k E Spec((/). 

Let A E Spcc(f/) be arbitrary, hence Agx — Ax for some x = (xq,..., x„) f 0. 
Assume that \x m \ is the largest among all |x; |. Then 


|Ax m | — |(/lgx) m | — ^ ^ 


< 


k\x m \, 


(4.1) 


Vi~V,„ 


thus |A [ < k, which proves (i). 

To verify (ii), it remains to show that k has multiplicity 1 in Spcc(Q). As¬ 
sume that Agx = kx for some x f 0. Let again \x m \ be the largest and w.l.o.g. 
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x m > 0 (otherwise consider —x). It follows from (4.1) (without absolute values) that 
12v ~v„, x ' ~ l <x m- Since v m has k neighbors, the sum has k terms and thus all 
corresponding summands x* must be equal to x m . Since Q is connected, iteration of 
the argument finally yields x, = x m for all i. This means that (I. I, 1/ spans the 
eigenspace corresponding to A = k. 

We are left with (iii). First assume that Agx — —kx for some x jk (). Again 
we may assume w.l.o.g. that some x m > 0 has maximal \x m \. Moreover, we may 
also assume that x, > 0 for 1 < i < s and x,- < 0 for / + I < i < n, while 
Xs+i = x s +2 = ... = Xt = 0. As in (4.1) we conclude that 

s n n 

-kx m = y xt + y x, > y x; > -kx m . 

i = 1 i=t +1 i=t+ 1 

Vi Vi~Vm Vi~Vm 


This means we have in fact equality everywhere, hence u,- •/ v m for 1 < i < s 
and x,' = —x m for all ry ~ v m . Iterating this argument, each neighbor Vj, say, of 
a neighbor Vi of v m has again xj — —x,- = x m . Since Q is connected, V splits into 
two sets, V\ {Vi : x,- = x m } and V 2 := {vt : x ; = — m}, such that u / v for any 
u,veV i or any u,v e V 2 . This means that Q is bipartite. 

To prove the opposite direction of (iii) we observe that after reordering the vertices 
a connected bipartite graph Q typically has an adjacency matrix of type 


Ag 


/0 0 0 0 0 0 * 

0 0 0 0 0 0 * 

0 0 0 0 0 0 * 

0 0 0 0 0 0 * 

0 0 0 0 0 0 * 

0 0 0 0 0 0 * 

****** 0 
****** 0 
****** 0 
V* ***** o 


* * 
* * 
* * 
* * 
* * 
* * 
0 0 
0 0 
0 0 
0 0 


*^ 

* 

* 

* 

* 

* 

0 

0 

0 

0 / 


where the entries in the * positions are 0 or 1. Now it is easy to see that 


Ag * (1,..., 1, — 1,..., — l) f = (~k, _ —k,k, _ k)‘ 

= (-k)d .i,-i.-i y, 


i.e., —k e Spec(^). 


□ 


Beyond the results of Proposition 2.26 the following facts are well known: 

• A /c-rcgular graph has as many components as the multiplicity of its eigenvalue 
k. 

• Spec((?) is symmetric about the origin, i.e., the multiplicities of A and —A 
in Spec(tJ) are the same, if and only if Q is bipartite (this uses the Perron- 
Frobenius Theorem: see e.g. [14], Theorem 8.8.1 on the eigenvalues and eigen¬ 
vectors of nonnegative matrices; also cf. Theorem 3.24). 
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Exercise 2.27. We know that k e Spec(C/) for a A:-regular graph Q. Show that Q 
necessarily is connected if k has multiplicity 1 in Spec((?). 

Definition 2.28. A graph Q is called circulant if Ag is a circulant matrix (briefly: 
a circulant, cf. [11]), i.e. a square matrix whose rows are obtained by cylic right shifts 
of the first row. 

Example 2.29. Clearly, Cay(Zio, {±1, ±3}), the cycle graphs C„ and the complete 
graphs IC n are examples of circulant graphs. 

Proposition 2.30 (Spectrum of C n ). Let n be a positive integer and (Oj exp 
for j = 0,1,..., n — 1. We have 

(i) Spec(C„) = {a>j + -±- \ j =0,1,...,/? — 1}. 

(ii) The largest eigenvalue of C n is 2 (with multiplicity 1), and the second largest 

is 2 cos (with multiplicity 2). The smallest eigenvalue is —2 (with mul¬ 

tiplicity 1 )for even n, and it is —2 cos ( ”~ 1)7r (with multiplicity 2) for odd n. 

(iii) C n is bipartite if and only ifn is even. 

Proof A helpful observation is that Ac n = P + P ] , where P is the permutation 
matrix of the permutation defined by a cyclic left shift. If o> is any n-th root of unity, 
then it is obvious that the vector (1 ,co, co 2 ,..., to n ~ l y is an eigenvector of P with 
eigenvalue co as well as an eigenvector of P ~ 1 with eigenvalue A. Since there are 

exactly n n -th roots of unity, namely co = coj for j — 0, I./; — I, identity (i) 

follows. 

By (i), the largest eigenvalue is coo + ^ = 2 (with multiplicity 1), and the 
second largest is 2 cos ^ n ~^ 7T = 0 )\ + = co n -\ + M 1 ^ (with multiplicity 2). 

The smallest eigenvalue is —2 (with multiplicity 1) for even n, and it is 2 cos ^”~ 1)7r 
(with multiplicity 2) for odd n. 

(iii) follows from (ii) and Proposition 2.26 (iii). □ 

Arguing as in the preceding Proposition 2.30 one can show 

Proposition 2.31. If A is a circulant n x n matrix with first row ao , a t,..., a n -i, say, 
then the eigenvalues Ay are given by 

n— 1 

X i '■ =z '£2 a krf O'=0,1.n-1), 

k—0 

with corresponding eigenvectors Vj := (1 ,a>j,u> 2 ,... , where coj \ — 

exp(^). 

A class of graphs which are interesting with respect to arithmetic consists of the 
so-called integral graphs. 
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Definition 2.32. A graph Q is called integral if Spec(t/) C Z. 

In 1974 Frank Harary and Allen Schwenk introduced this concept and published 
a volume of Springer Lecture Notes entitled “Which graphs have integral spectra?” 
[16]. A graph theoretical classification of all integral graphs is still unknown. An 
important subclass of the integral graphs is considered in the following theorem and 
is thus much better understood than integral graphs in general. 

Theorem 2.33 (Wasin So [46], 2005). Every integral circulant graph Q is uniquely 
characterized by a so-called gcd graph gcd(«, T>) = ICG(«, T>) defined as follows: 
n is the order of Q and gcd(n, V) — (Z„, E) is a Cayley graph on Z„, where T> is 
a set of positive divisors of n and E {(a, b) : a, b e Z„, (a — b, n) e T>}. 

Sketch of proof Let Q be any circulant graph. Circulant graphs are obviously regular, 
more precisely they are Cayley graphs on cyclic groups, i.e., w.l.o.g. Q ~ Cay(Z„, S ) 
for the order n of Q and a suitable set S. By Proposition 2.31 we know that each 
eigenvalue A e Spec(C?) is an integral linear combination of ;;-th roots of unity. 
By a famous result of Lam and Leung [26] published in 2000, one has a clear un¬ 
derstanding of when sums of roots of unity vanish, and one could apply this re¬ 
sult here. However, it luckily turns out in our situation that if S is a union of sets 
Sj := {d, 2d,..., (j — 1 )d} for some divisors d \ n satisfying (d, A-) = 1, then 
each eigenvalue A can be expressed as a sum of Ramanujan sums, a special sum 
well known in analytic number theory and, in particular, always with integral val¬ 
ues. Consequently, we have Spec(<5) C Z. Of course, it remains to check that 
Spec (Q) C Z imposes the above mentioned structure on S. It is then not difficult to 
check that S — [J Sd yields the asserted edge set E. □ 

In Section 4 we shall study arithmetical properties of integral circulant graphs. 

Over the last decades the theory of expander graphs has attracted quite a lot of 
interest. These graphs with strong connectivity properties have a lot of applicatory 
consequences, e.g., the resolution of an extremal problem in communication network 
theory (cf. [6]), and they are also of importance in theoretical computer science 
(cf. [19]). A special class of expanders are Ramanujan graphs, which were introduced 
by Lubotzky, Phillips, and Samak [31] in 1988. 

Definition 2.34. A A-regular graph Q = (V, E) is called a Ramanujan graph or 
simply Ramanujan, if A(Q) := max{|A|: A e Spec(C?), |A| < k} satisfies A (Q) < 
2y/k - 1. 


As we shall see in Theorem 3.20 below, these graphs are intimately linked with the 
theory of primes on graphs (see [35]). In Section 4 we shall examine which integral 
circulant graphs are Ramanujan graphs. 
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Exercise 2.35. Which of the cycle graphs C n and the complete graphs K. n are Ra¬ 
manujan? 


3 The prime number theorem for graphs 

We denote by IP the set of prime numbers and, as usual, by 

jt(x) := ^ 1 

p<x, peF 

the function counting the prime numbers up to x in the set of positive integers. There 
is indeed a concept of primality in graphs, and this will be defined for arbitrary finite, 
undirected and connected graphs Q. We introduce a corresponding prime counting 
function ng(x) and relate it to the Ihara zeta function. This reveals analogies to other 
zeta functions, in particular the classical Riemann zeta function with its application 
to jz(x) (cf. [4], [23] or [37]). We shall derive determinant formulae and functional 
equations and consider a Riemann hypothesis, which is related to the Ramanujan 
property of a graph (cf. Def. 2.34). Finally, the Ihara zeta function is used to prove 
a graph-theoretic prime number theorem. Most of the material covered in this section 
is taken from Audrey Terras’ book “Zeta Functions of Graphs - A Stroll through the 
Garden” [48]. 

3.1 Primes in friendly graphs 

Assumption. The graphs Q — ( V, E ) we consider always are finite, undirected, and 
connected. 

Unless otherwise stated, we also assume that Q has no leaves (= vertices of de¬ 
gree 1) and is not a cycle graph with or without leaves (i.e., Q \ {leaves} ^ C„ for 
all n, in other words Q is not unicyclic). 

A graph is called friendly if it satisfies the above assumption. 

Exercise 3.1. Show that every friendly graph with at least one edge contains a cycle 
and, therefore, is multicyclic. 

Let Q = (V, E) he a friendly graph with E — m. We orient the m edges of Q 
arbitrarily to obtain directed edges e\ , ez, ■ . ■, e m (with arbitrary labelling). Then we 
label the inverse edges by setting e m +j := ej 1 , where ej 1 — (u, u) for ej = (u, v). 

The graph Q = ( V, E) with E E U E~ l is called an orientation of Q. 
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Figure 4.8. An orientation of K .4 — e 
Example 3.2 (An orientation of /C '4 — e). 

Definition 3.3. Let Q = (V, E) be a graph, and let Q = ( V, E) be an orientation of 
Q. For m : = \E\ = \\E\, the 2m x 2m matrix Wg = ( Wij ) is defined as follows: 

Set Wij — 1 if e t ej is a directed path (of length 2) in Q, but not e~ l — ej , and 
otherwise Wjj 0. Then Wg is called the edge adjacency matrix of Q. 

Definition 3.4. Let C — e\e 2 .-.e s be a path of edges in an orientation Q of 
a graph Q. 

• C is said to have a backtrack if ej +1 = ej 1 for some j G {1,2,..., s — 1} or 
e s = e\ x . 

• If C is closed but not the power of a shorter cycle, then we call it a primitive 
or prime path if it has no backtracks. 

• Two cycles are said to be equivalent if we get one from the other just by chang¬ 
ing the starting vertex. For C closed, 

[C] := {C, ^ 2^3 • • • s s ei, e-i ... e s e\e2, ■ ■ ■ , e s e\e-2. ■ ■ ■ e s -i} 
is called the equivalence class of C. 

• A prime in Q is an equivalence class [P] of prime paths, i.e., P is a primitive 
path. 

• v[P] := v(P) is called the length of the prime [/*]. 
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Example 3.5. 

(i) Primes in /C 4 — e are (Ex. 3.2 cont’d): 

• [Ci] for Ci := e 2 e 3 e 5 and [ei 0 e%e-]] = [Cj -1 ] with lengths v[Ci] = 

v[Cf 1 ] = 3; 

• [C 2 ] for C 2 := eie 2 e 3 e 4 with length v[C 2 ] = 4; 

• [C”C 3 ] for C 3 := e\ t'iot '4 and each /1 > 0, hence there are infinitely 
many primes in /C 4 — e. 

(ii) In naughty graphs one could also look for primes, but the example at the be¬ 
ginning of this section shows that there are only two primes, namely the two 
orientations of the unique cycle. 

Exercise 3.6. 

(i) Check that equivalence between cycles is in fact an equivalence relation. 

(ii) Let C be a prime path. Show that each closed path equivalent with C is also 
a prime path. 

(iii) Verify that powers of primes are the only non-primes among cycles. 

(iv) If C is a cycle, we can factorize it into subcycles Ci and C 2 if Ci and C 2 have 
a common vertex v and, starting from v, we obtain C by concatenation of Ci 
and C 2 . Give examples of the fact that we do not have unique factorization of 
cycles into primes, e.g., in Example 3.5. 

Definition 3.7. Let Q be a friendly graph, and let Yg be the set of primes in Q. Then 
ng(k) ■= #{[P] e Yg : v[P] = k] 

is called the prime counting function. (Observe the = sign as opposed to < in the 
prime counting function for integers). 

An important parameter will be the greatest common divisor of the prime path 
lengths 

Ag := gcd{v[P] : [P] e Pg} (4.2) 

in a friendly graph Q. 

Proposition 3.8. Let Q be a friendly graph. Then Jtg(k) — 0 if Ag \ k. 

Proof. Since Ag \ v[ P] for each prime [P], there is no prime [ P] with v[P] = k for 
A g\k. □ 

3.2 Ihara’s zeta function The Riemann zeta function encodes information 
about the distribution of primes in the set of positive integers and can be used to 
prove the prime number theorem (cf. [4], [23] or [37]). Other zeta functions have 
been defined and studied in order to obtain information about mathematical objects 
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like prime numbers in arithmetic progressions (Dirichlet L-functions), prime ideals 
in rings of algebraic integers (Dedekind zeta functions), primitive closed geodesics 
in manifolds (Selberg zeta function), and many more (see [24], [27], [33], [38]). We 
shall look at a zeta function related to primes in graphs and will uncover analogies 
with the other zeta functions. 

The Ihara zeta function was first defined by Yasutaka Ihara [21] in the 1960s in the 
context of discrete subgroups of the two-by-two /j-adic special linear group. Jean- 
Pierre Serre suggested 1977 in his book “Arbres, Amalgames, SL 2 ” (= “Trees" [43]) 
that Ihara’s original definition could be reinterpreted graph-theoretically, namely as 
a zeta function related to closed geodesics in graphs (compare with the Selberg zeta 
function). In 1985 Toshikazu Sunada [47] put this suggestion into practice. 

Definition 3.9. Let Q be a friendly graph. The Ihara zeta function is the complex 
function 

&(«<) = ?(*.£): = n (l-w WP1 )"\ (4.3) 

[ifielPg 

where u e C is supposed to have sufficiently small absolute value. Recall that we 
distinguish between the primes [P] and [P -1 ]. The question of convergence will be 
postponed until after Proposition 3.11. 

Example 3.10. Although the cycle graph C n is not friendly (with only two primes = 
cycle in two orientations), we still can evaluate its Ihara zeta function: 

( OO 

I 

k =0 

Let us explain at the outset why the Ihara zeta function of a friendly graph Q is 
useful in dealing with primes in Q. To this end, let N m (Q ) be the number of cycles of 
length m without backtracks in Q . 

Proposition 3.11. For all w e C with sufficiently small absolute value \u\ it holds 
that 

logfc(»)=E^)»". (4.4) 

L —' m 

m— 1 

Proof. Taking logarithms in (4.3) we obtain 

logCg(M) = log Y\ 1o § ( 1 - u v[p] ^j 

[P]ePg [T]6Pg 

°o 1 00 1 

= y vV j, i= y — rV p ) 

/—I Z—l j /—! v (P) t— 1 j 

[TJelPg j = \ P primitive v j = l 

= y y-L-u v ( pJ \ 

V(PJ) 

P primitive J = 1 
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since there are exactly v(P) elements in a prime [P] and trivially v ( P 7 ) = jv(P). 
As seen in Exercise 3.6 (iii), any cycle without backtrack in Q is a power of some 
primitive cycle. Hence 





C cycle w/o backtr. 



□ 


m 


m — \ 


Now let us verify that ^g(u) does indeed converge inside some circle in the com¬ 
plex plane with positive radius centered at 0. By Proposition 2.24, (A™ )jj equals the 
number of directed paths of length m from V j e V to Vj , which in turn is the number 
of directed cycles of length m containing Vj . Hence ( A™)jj is greater than or equal 
to the number of cycles of length m containing vj. Therefore, setting n \ V for 
the number of vertices in Q, we have 


n 


E (Wjj ^ n "W- 


Since Ag has only non-negative entries 0 and 1, it is easy to see that 


(Ag)ij < (1 m )ij, 


where 1 denotes the n x /;-matrix with all entries 1. Obviously, 1 m — n m 1 1 ] j hence 


n 


N m (G) < E 



j= 


For \u\ < the sum 



converges absolutely. By Proposition 3.11, this implies that log £ g(u ) is well defined 


for | m | < j-, and thus the same is true for ^g(u) itself. 


□ 


We introduce the notion of the radius of convergence Rg such that (,g(u ) con¬ 
verges inside \u\ < Rg and has a singularity on the border \u\ — Rg, called circle of 
convergence. 

One might like to compare formula (4.4) with related identities for the classi¬ 
cal Riemann zeta function and the von Mangoldt function or other characteristic 
functions for primes in Z (cf. [4] or [23]). 

We are lucky to have explicit formulae which allow us to compute %g(u). One of 
them is called the two-term determinant formula (Theorem 3.31) and relates %g(u) to 
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the determinant of the edge adjacency matrix Wg (cf. Definition 3.3). This formula 
can be used to obtain the so-called lham three-term determinant formula (Theorem 
3.15). 

In order to formulate the latter result, we should consider the fundamental group 
of a graph and its rank. The fundamental group, in general associated to any given 
pointed topological space, provides a way to determine when two paths, starting and 
ending at a fixed base point, hence called loops, can be continuously deformed into 
each other. Such loops are considered equivalent. Two loops can be combined by just 
wandering around the first one and afterwards around the second one. The elements 
of the fundamental group are the equivalence classes of loops and the operation is the 
combination of these. 

One of the simplest examples is the fundamental group of the circle, which is 
obviously isomorphic to the group (Z, +). This corresponds to the fundamental group 
of any of the cycle graphs C n (with or without leaves). In general, it turns out that the 
fundamental group T {Q, v i ) of a graph Q — {V, E) relative to the base vertex iq e V 
is a free group on r generators, where r is the rank of the group. T(Q, v i) and, in 
particular, its rank r can easily be determined by the following 

Algorithm 3.12. Let Q — (V, E) be a graph (possibly directed with multi-edges and 
loops). 

(1) Choose any orientation of Q to obtain Q. 

(2) Choose any spanning tree T = (V, E') of Q (which exists by Proposition 2.9). 

(3) For each edge iq v j e E \ E' , construct a path from v 1 to iq and from vj back 
to v | (all these paths are unique by Exercise 2.8) to obtain a cycle starting from 
tq via Vi and vj back to iq. 

Proposition 3.13. Let Q = (E, E) be a graph with tq e V. 

(i) The equivalence classes of the cycles obtained in (3) of Algorithm 3A2form 
a free basis ofT(Q , tq). 

(ii) The rank r of T(Q ,tq) satisfies r = \E\ — |E| + 1. 

Proof The fundamental group V(Q, tq) consists of sequences of cycles (more pre¬ 
cisely, equivalence classes of cycles) starting from iq. The spanning tree constructed 
in step (2) of Algorithm 3.12 does not have any cycles. However, each edge in E\E' 
completes a unique cycle, and different edges in E \ E' yield inequivalent cycles. 
This proves (i). 

By (i) we know that r = \E\— \E'\, where E' is the edge set of a spanning tree 
of Q. By Exercise 2.8, a tree on | V vertices has | V\ — 1 edges. This proves (ii). □ 

Example 3.14. We determine a free basis and the rank of the fundamental group of 
the graph Q below: 


142 


Jurgen Sander 



c 



Spanning tree + orientation of Q 



bed 


( 7 ): ghfe~ l d 

(T): ih^g - 1 

( 7 ): 9h] a 


Fundamental group: 

r(^,ui) generated by [bed], [ ghfe~ 1 d ], [ih~ 1 g~ 1 ] and [ghja] 

Rank r of T(Q, Ui): 
r = 4 = 10-7+1 = |£| - |R| + 1 


Figure 4.9. How to determine the fundamental group of a graph 
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(1) We first choose a vertex iq, any orientation of Q, and any spanning tree T of 

g. 

(2) For each directed edge e in Q not belonging to T, we construct the unique 
directed cycle in Q starting from tq through e. 

(3) These cycles are a free basis of V(G, v i) and their number is the rank of the 
fundamental group. 

Now we are able to formulate the three-term determinant formula, first proved by 
Y. Ihara [21] in 1966 and later generalized by K. Hashimoto [18] in 1989 and H. Bass 
[5] in 1992. 


Theorem 3.15 (Ihara’s three-term determinant formula). LetQ = (F, E) be a friendly 
graph with V — {tq,..., u„}. Let Qg — (r/q) be the n x n diagonal matrix with 
diagonal entries qjj — d(vj) — 1, where d(vj) is the degree of Vj. Then 

tg(u) -1 = (1 — u 2 ) r ~ 1 det (7 — Agu + Qgu 2 ), 

where I is the unit matrix and r = | E | — | F | + 1 is the rank of the fundamental group 

of G- 


Before proving Theorem 3.15 in Section 3.3, let us deduce a variety of conse¬ 
quences of the three-term determinant formula. 


Example 3.16. (cf. Fig. 4.10 and 4.11) Consider the complete graph IC 4 with r = 
6-4+1 =3, 

/0 1 1 K 
, 10 11 

A IC 4 - J 101 

Vi 1 1 0/ 

and Q/c 4 = 27, because /C 4 is 3-regular. The three-term determinant formula yields 


£jc 4 (m) 1 


(2 u 2 + 1 

—u 

—u 

—u 

\ 

—u 

2 u 2 + 1 

—u 

—u 


—u 

—u 

2 u 2 + 1 

—u 


y —U 

—u 

—u 

2 u 2 + 

V 


= (1 — w 2 ) 2 (l - w)(l — 2 m)(1 + u + 2 u 2 ) 3 . 


Flence the five poles of £/e 4 (n) with different multiplicities are located as follows: 
three real poles at ±1 and and two complex poles at |(—1 ± i \fl) (with absolute 

value ^). Since the pole closest to the origin lies at we have A+ 4 = ^ as radius 
of convergence. 
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Figure 4.10. Ihara’s zeta function for 1C 4 in the invariable: z = |£jc 4 (v + iy)\ . This figure is 
taken from [48] with permission of the author. 



13 


Figure4.11. Ihara’s zeta function for IC 4 in the j-variable: z = | (2 1. This figure is 

taken from [48] with permission of the author. 


Exercise 3.17. Show that the irregular graph K.4 — e (cf. Example 3.2) satisfies 

^/C 4 -e(u)~ L = (1 — u 2 )(l - u)( 1 + u 2 )(l + u + 2u 2 )(l — u 2 - 2 m 3 ) 

with nine roots: ±1, ±i, ^(— I ± i sfl) and three cubic roots m, M 2 . M 3 , say, satis¬ 
fying |sj| sa 0.657 and |5 , 2[ = [ 53 ! ~ 0.872. Hence the radius of convergence is 
R)C4-e ~ 0.657. 

We shall now specialize on regular graphs. For this graph class a lot of the con¬ 
cepts we are interested in are partly easier to understand, but most results can be 
generalized to irregular graphs. 

Reconsider the regular graph IC 4 (cf. Example 3.16) with 

fcM - 1 — (1 - m 2 ) 2 (1 - m )(1 - 2 m )(1 + m + 2 m 2 ) 3 . 

We take a look at z = |(7q(a' + iy)\~ l (Fig. 4.10), depicting the five zeros (not 
counting multiplicities). 
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Figure 4.12. Riemann’s zeta function z = |£(* + iy)|. This figure is taken from [48] with 
permission of the author. 


To catch another (logarithmic) sight of z = [£jc 4 (m)| 1 , we make the change 
of complex variable u ^ and display the landscape of z = \^,K^^ x+iy ^)V l 
(Fig. 4.11). 

This bears some resemblance to the landscape z = |£(x + iy)\ of Riemann’s zeta 
function £(s) (Fig. 4.12). 

In fact, the resemblance between the zeta functions of Riemann and Ihara is strik¬ 
ing. The classical Riemann hypothesis (see e.g. [23] or [37]) conjectures the loca¬ 
tion of the non-trivial zeros of the Riemann zeta function. 

Definition 3.18. Let Q be a friendly (q + l)-regular graph and consider the Ihara zeta 
function t,g(q~ s ) as a complex function in s e C. We say that ^g(q~ s ) satisfies the 
Riemann hypothesis if ljg(q~ s )~ l can vanish in the critical strip 0 < Re s < 1 only 
for Re.v = i.e., for \u\ = |g -,s | = -^=. 

Exercise 3.19. Check that satisfies the Riemann hypothesis (cf. Example 

3.16). 

Theorem 3.20. Let Q be a friendly (q + 1 )-regular graph. Then %g(q~ s ) satisfies the 
Riemann hypothesis if and only ifQ is a Ramanujan graph (cf. Definition 2.34). 

Proof By the determinant formula in Theorem 3.15, we have 

^“T 1 = (1 - q~ 2s Y~ l det(7 - Agq~ s + Qgq~ 2s ). (4.5) 

Since Ag is a real symmetric matrix, it is diagonalizable (even by means of orthonor¬ 
mal matrices). Hence there is an invertible matrix T such that T~ 1 AgT — Lg, where 
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Lg is a diagonal matrix with the eigenvalues of Q on the diagonal. Since Q is assumed 
to be (q + 1)-regular, we have Qg = ql. Altogether 

det(7 - Agq~ s + Qgq~ 2s ) = det ( TIT~ 1 - T(q- s Lg)T~ l + T(q l ~ 2s I)T~ l ) 

= det T ■ det — q~ s Lg + q l ~ 2s I^j ■ det T~ l 
= det — q~ s Lg + q l ~ 2s I^j 

= 11 (1 -Xq-'+q 1 - 2 ’), 

AeSpec (Q) 

hence 

tgiq-’T 1 = (1 - q- 2 *)'- 1 Hi 1 ” ^ + 0 1 " 2 ')- 

AeSpec(^) 

Fix some A e Spec(^) and write 

1 — A q~ s + q l ~ 2s = (1 — aiq~ s )( 1 — d 2 q~ s ), 

i.e., oq and d 2 are reciprocals of poles of £ g(q ~ s ) satisfying oi\ + a 2 — A and 
d\d 2 = q. This implies the quadratic equation d 2 — Acq = —q, leading to 

d\ = -(A + \]\ 2 - \q ) and d 2 — -(A — sjX 2 — \q ), (4.6) 

or vice versa. 

By Proposition 2.26 we know that q + 1 e Spec(CJ) and |A| < q + 1 for all 
A e Spec((/). We distiguish three cases, namely A = ±(q + 1), |A| < 2 Jq, and 
2 Jq < |A| < q + 1. 

Case 1: A = ±(q + 1). 

By (4.6), we immediately obtain di — ±q and d 2 — ± 1, or vice versa. 
Case 2: |A| < 2Jq. 

It follows that A 2 — 4 q < 0, thus d\,d 2 e C and ^A 2 — 4 q = ±i ■ 
J \q — A 2 . This implies that 

|ati| = ^|A±i • \/— A 2 1 = ^-y/A 2 + (4 q - A 2 ) = Jq , 

and similarly \d 2 \ = Jq. 

Case 3: 2Jq < |A| < q + 1. 

Now d \, d 2 e K. Observing that A + sj'k 1 — 4q is increasing in A, we obtain 
by (4.6) for positive A, i.e., for 2 Jq < A < q + 1, that Jq <d\ < q. Since 
d i d 2 = q, this implies 1 < d 2 < Jq. Checking the vice versas and A < 0, 
we conclude that 1 < |a, | < q, but |otj| ^ Jq for i = 1,2. 
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Figure 4.13. Possible locations for poles of %g(u) for regular Q. This figure is taken from [48] with 
permission of the author. 


To complete the proof we just have to remember that the poles of t,g(q~ s ), other 
than those coming from the first factor in (4.5) and having 91 s = 0, are the reciprocals 
of some oil > i.e., U-i — q s and \ai\ = = q^ s . The eigenvalues ±(q + 1) e 

Spec (Q) are maximal in absolute value and therefore irrelevant for the Ramanujan 
property. At the same time, they do not affect the Riemann hypothesis, because they 
correspond to poles having 315 = 1 or 315 = 0 (see Case 1). The eigenvalues 
considered in Case 2 satisfy the Ramanujan condition and the corresponding a t all 
have 315 = ^, satisfying the Riemann hypothesis, as shown in Case 2. If there existed 
an eigenvalue A e Spcc(C) belonging to Case 3, it would violate the Ramanujan 
condition. At the same time, Case 3 has revealed the existence of a pole with 0 < 
915 < 1, but 915 y 1, thus contradicting the Riemann hypothesis. □ 

The preceding proof immediately implies 

Corollary 3.21. Let Q be a friendly (q + l)-regular graph. 

(i) The figure above shows the possible locations for the poles of C,g(u). Poles 
satisfying the Riemann hypothesis are those on the circle. 

(ii) The radius of convergence satisfies Rg = K 

Setting again u = q~ s , there are several functional equations for the Ihara zeta 
function on a regular graph, relating the value at 5 to that at 1 — 5 (corresponding to 
u <r¥- just as in the case of the Riemann zeta function. 

Theorem 3.22 (Functional equations for Ihara zeta functions). Let Q be friendly 
(q + l)-regular graph of order n and rank r of its fundamental group. Then, among 
other similar identities, we have 

(i) $\u) := (1 - u 2 r~ l + 'i (1 - q 2 u 2 ) n Hg(u) = (-1)"$ } (£); 

(ii) 4 2 V) := (1 + uy~\ 1 - uy~ 1+n ( 1 - qung(u) = 

(iii) £g 3 V) : = 0 - u 2 ) r ~\ 1 + qu)' l ^g(u) = 
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Proof of (i): The determinant formula in Theorem 3.15 and the transformation u i-> 

qu lnl P'y 

Ijg \u ) = (1 — m 2 )2 (1 — q 2 u 2 ) 2 det(7 — Agu + qu 2 7) _1 



Exercise 3.23. 

(i) Check (ii) and (iii) in Theorem 3.22. 

(ii) Show that has a pole at ^ if it has one at u. Verify that is the complex 
conjugate of u if u lies on the circle of radius and that ^ lies in the interval 

(»■ W )fOTMG( W’ 1) - 


Most of the results in this section can be generalized to irregular graphs. All we 
shall present here is a theorem showing the possible location of the poles of for 
irregular Q. 


Theorem 3.24 (Motoko Kotani and Toshikazu Sunada [25], 2000). Let Q = ( V , E ) 
be a friendly graph with p + 1 := min{ri(u) : v e V} and q + 1 := max{t/(u) : 
v E V}, and let Rg be the radius of convergence offg. 

(i) We have ^ < Rg < and every pole u oft,g{u) satisfies Rg < \u\ < 1. 

(ii) Every non-real pole u e C of ^g(u) lies in the ring < |m| < 

(iii) The poles u of^g(u) lying on the circle |m| = Rg have the form 

2itik 

u = R g e so (k = 1,2,..., Ag), (4.7) 

where Ag = gcdlufT^] : [ P] e Fg}, as defined in (4.2). 


The most interesting part of the preceding theorem is the characterization in (iii). 
It is a rather straightforward consequence of an advanced result in linear algebra 
called the Perron-Frobenius theorem (cf. [20]), which essentially states: 


Let M be an n x n matrix with non-negative real entries and irreducible, 
i.e. (7 + M ) n ~ 1 has only positive entries. Then the eigenvalues e 

Spec (M) (k = 1. K ), say, of maximal absolute value are given by 

Afc = maxSpec(M) • e 2 ^. 


Applying this to the edge adjacency matrix M := Wf, proves (iii). 
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Example 3.25. The irregular graph Q on the left below is obtained by adding four 
vertices to each edge of the complete graph /C 5 . Clearly, p + 1=2 and q + 1=4. 



On the right, all poles u ±1 of C,g (u) are depicted (observe 5-fold rotational 
symmetry). The circles centered at 0 have inverse radii -^= = ^ ss 0.577, Rg = 

ss 0.896, and -^ = 1, coming from £g(n) _1 = (w 5 ) = (1 — n 10 ) 5 

(1 — 3m 5 )(1 — it 5 )(1 + u 5 + 3m 10 ). (The figures are taken from [48] with permission 
of the author.) 

Exercise 3.26. Prove that in our case K = Ag (cf. [48], p. 95). 

We shall see in Exercise 3.34 how to produce pictures like Figure 4.14. 


3.3 The determinant formulae for Ihara’s zeta function A proof of 
Ihara’s three-term determinant formula has still to be given. We shall proceed by 
first showing the two-term determinant formula (Theorem 3.31) mentioned above 
and derive Theorem 3.15 from it. First we have to introduce the exponential function 
and the logarithmic function for square matrices (as one usually does in the theory of 
Fie groups). 


Definition 3.27. For an n x n matrix X over K or C, the matrix 


exp X 



is called the matrix exponential of X. 

Proposition 3.28. Let X,Y be n x n matrices over K or C. 

(i) The series defining exp X converges absolutely. 

(ii) For the zero matrix O we have exp 0 = 1. 
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Figure 4.14. Locations of poles of for an irregular random graph Q with 800 vertices: 

- violet circle of radius s fp, blue circle of radius Jq ; 

- green (Riemann hypothesis) circle of radius . 

RH says spectrum should lie inside green circle - almost true! 

This figure is taken from [48] with permission of the author. 


(iii) If X = ( Xij) is diagonal, then exp X = {xij) with xa = e Xii and Xij = Ofor 

i j ■ 

(iv) If XY = YX, then (exp JO (exp Y ) = (exp K)(exp X) = exp( X + y). 

(v) If Y is invertible, then exp(7 _1 XY) = Y^ 1 (exp X) Y. 

(vi) det(expZ) = e tiX . 

Proof of (vi). For any X there is an invertible matrix Y such that YXY~ l is an upper 
triangular matrix T, say (see any book on linear algebra). Hence 

det(expX) = det(exp(F _1 2 n y)) = det(y _1 • (expT 1 ) • Y) 

= det(y _1 ) • det(exp T) ■ det Y = det(exp T). 

Since T = (t t j) is upper triangular, so is T k = (t^) for all k. In particular, — 

t k for all i and k. This implies, by the definition of the matrix exponential, that 
exp T — (tij) is upper diagonal with ta = e tu for all i. Hence 

det (exp T) = e tn ■ ■ ■ e‘ nn = e" T = e trX , 

since tr T = tr(y(Jfy -1 )) = trftir^y) = trX. □ 

Exercise 3.29. Prove Proposition 3.28 (i)-(v). 

Definition 3.30. If the n x n matrices X and Y over K or C satisfy exp Y = X , then 
y = log X is called a matrix logarithm of X. 
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log X does not exist for every matrix X, and if it exists it is usually not unique. 
For X = ( Xij ), 

n n 

ll^llFrob := EEM 2 
\ i = l7 = 1 

is called the Frobenius norm or Schur norm of X. If | X || Frob < 1, then 


°o 

log (I-X) = -J2^ k 
k = 1 K 


(4.8) 


is a matrix logarithm of X. 

Theorem 3.31 (Two-term determinant formula). Let Q be a friendly graph and let Q 
be an orientation ofQ with edge adjacency matrix Wg. Then 

£g(u) _1 = det(7 — Wgu). 

Proof. Let Q — ( V, E) and m : = £j. Recall that N m (Q) is the number of back- 
trackless cycles of length m in Q. We first claim that 

N m {Q) = trlkf. (4.9) 

y 

To prove this, let Wg — (w,j ) and W~ = for all positive k. By induction, 

^ 7 ° = E E ''' E w Ui w hl2 ■ ■ ■ 


tr w™ = e = E E E • ■ ■ E wu i w ^~ ■ ■ ■ wi >n-p ■ 

i i i\ £2 ^m—1 

Since we have wu 1 wi 1 e 2 ... wi m _p = 1 if and only if is a closed 

path of length m starting with edge e ( , tr Wl" apparently counts all these paths, and 

Q 

just the ones without backtracks due to the definition of wp. This proves (4.9). 

By virtue of identities (4.4) in Proposition 3.11 and (4.9), we have for sufficiently 
small \u\ 


oo 

log Sg(u) = E 

m= 1 


N m {G) 

- U 

m 



(4.10) 


where the last identity follows from the fact that tr is continous and linear. 
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If \u\ is sufficiently small, we clearly have | W^u ||Frob < 1, and consequently 

OO J 

J2-(W d u) m = -\og(I-W d u) 
m y y 

ra= 1 

by (4.8). This and (4.10) imply 

Kg( u)~ l = e tr ( log(/ W s ll) ) = det (exp (log(7 — W^u))) = det(7 — W^u), (4.11) 
thanks to Proposition 3.28 (vi). □ 

Corollary 3.32. For a friendly graph Q with orientation Q and edge adjacency matrix 
Wg, we have 

&(w) _1 = f~[ (1 — Am). 

AeSpec(J^) 

In particular, the poles ofKg(u) are the reciprocals of the eigenvalues of Wg. 

Proof det(z7 — IT -) is the monic characteristic polynomial of IT -, i.e., 
det(z7 - W § ) = n (- A >- 

AeSpec(T^) 

For the number m of edges in Q, we consequently have 

det(7 — Wgu) = det — 7 — 1T^j j = u 2m ]~~[ ^-A^ 

AeSpec(W^) 

= Yl (1 — Am). □ 

AeSpec(T^) 

The last ingredients we need to verify Theorem 3.15 are a few identities for the 
following block matrices related to an orientation Q of a graph Q — ( V, E) with 
n \ V\ and m : = \E\: 

• The 2m x 2m matrix 

r _ / O m I m \ 

\Im O m ) ’ 

where O m denotes the m x m zero matrix and I m is the m x m unit matrix; 

• the n x 2m start matrix Sg = (stj), derived by 

! 1, if Vi is the starting vertex of ej in Q, 

0, otherwise; 
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• the n x 2m terminal matrix Tg — ( tjj ) defined by 

1, if Vi is the terminal vertex of ej in Q, 

0, otherwise. 

Note that, upon numbering directed edges in our orientation Q of Q by the rule 
ej+ m ej 1 , we have S — (M\N) and T = (N\M) for two suitable n x m 
matrices M and N. 

Proposition 3.33. Let Q — ( V, E ) be a graph with n \V\ and m : = \E\, and 
let Q be an orientation of Q. Let J, Sg, and Tg be defined as above, Ag be the 

adjacency matrix of Q, W'j, be the edge adjacency matrix of Q, and Qg be as defined 
in Theorem 3.15. Then 

(i) S^J = Tg and T^J = Sg; 

(ii) Ag = S § Tl and Qg + /„ = SgSl = TgTl; 

(lli) w d + j = ns § . 

Partial proof The proofs of (i) and the second identity in (ii) are left as exercises. By 
the definition of matrix multiplication, 



2m 

(W* = (4I2) 

7 = 1 

where sg- tkj = 1 if and only if the edge e t has starting vertex v, and terminal vertex 
v/c, and Sjjt/cj — 0 otherwise. Hence the right-hand side of (4.12) equals the number 
of oriented edges from u, to Vk, and precisely this is the entry /1 (/ -. This proves the 
first identity in (ii). 

Similarly, we have 

n 

( T § s g)ik = XI 

7 = 1 

where tjiSjk = 1 if and only if the edge a feeds via the vertex vj directly into the 
edge ek (and this is even true in case ek = ef ] ), and t jiSjk — 0 otherwise. Hence 

E j 1. if et feeds directly into e k , ,, n , 

f J iS J k = 0, otherwise. (4 ' 13) 

7 = 1 V 

By definition, this equals the entry Wik of Wg for c J k f- ef 1 . Recalling our labeling 
convention ej +m := e~ l in Q, we sec that the right-hand side of (4. 1 3) equals Wjk + 1 
in case ek — ef 1 . Putting everything together, (iii) follows. □ 
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Exercise 3.34. 

(!) Prove (i) and the relation Qg + /„ = SgSt = 7-7’l from Proposition 3.33. 

(ii) Show that Ag = SgTl even holds if Q is allowed to have loops. 

(iii) Use Prop. 3.33 to create random graphs (see figure at the end of Section 3.2): 

Take random permutation matrices P\ .and define M P\ . P^. 

Similarly, build TV, and then define Sg := (M|7V) and Tg : = ( N\M ). Finally, 
obtain Wg by Proposition 3.33 (iii). 


Proof of the three-term determinant formula — Theorem 3.15. Let n = \V\ and 777 = 
|£|, and let A := Ag, Q := Qg, W := Wg, S := Sg and T := Tg. All four-block 
matrices in the following calculations are of size {n + 2m) x (n + 2m) with blocks 
of size n x n in the upper left corner. Then 


and 


M l := 


(I n 0\ (I„(l-U 2 ) SU \ 
\T‘ I 2 m)\ o hm-Wu) 

(I n (l-U 2 ) Su \ 

{rfl-u 2 ) T‘Su + hm-Wu) 


(I n — Au + Qu 2 Su \ ( I n O ) 

^ O I 2m + Ju)'\T t -S t u I 2m ) 

(l n — Au + Qu 2 + ST‘u — SS‘u 2 Su \ 
y {I 2 m + J iQiT 1 — S‘u) I 2m + J u J 


We claim that Mi = M 2 . For the upper left corner of M 2 we have, by Proposition 

3.33, 


I n - Au + Qu 2 + ST t u - SS‘u 2 = I n - ST‘u + ( SS‘ - I„)u 2 + ST l u - SS‘u 2 


Similarly, the lower left corner of M 2 equals 

(I 2m + Ju)(T* - S‘u) = T l - S’u + JT‘u - JS r u 2 

= T‘ - S‘u + (!TJfu - ( SJ)‘u 2 

= T r - S‘u + S f u - T‘u 2 = 

= r ( (l-n 2 ), 


and finally the lower right corner of M\ is 

T'Su + I 2m - Wu ( = PSu + I 2m — (T‘S — J)u 


— I2m + J u . 
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which proves our claim. Consequently, det M\ — det M 2 with 

detMi = det(/„(l — u 1 )) det{I 2m — Wu) 

= (1 - u 2 ) n det {1 2m - Wu) = (1 - m 2 )"^ 6 (m)~ 1 

by the two-determinant formula Theorem 3.31. On the other hand, 

det M 2 = det(/„ — Au + Qu 2 ) det(I 2m + Ju), 

which implies 

te(w) _1 = (1 — u 2 )~ n detMi = (1 — u 2 )~ n det M 2 

= (1 — u 2 )~ n det(I n — Au + Qu 2 ) det(/ 2m + Ju). 

Observe that 



hence 



By (4.14), this yields 

£g(w) -1 = (1 — u 2 r~ n det(/„ — Au + Qu 2 ) = (1 — u 2 ) r ~ l det(/„ — Au + Qu 2 ) 


for connected graphs, by Proposition 3.13 (ii). 


□ 


3.4 The prime number theorem for graphs For a friendly graph Q, we de¬ 
fined Fg as the set of primes in Q, the greatest common divisor Ag = gcd{v[P] : 
[P] e Fg } and the prime counting function 


ng(k) := #{[P] e Fg : v[P] = k}. 


Recall that Rg denotes the radius of convergence of (g and is the reciprocal of the 
modulus of the largest eigenvalue of (cf. Corollary 3.32), while the second largest 
modulus of the eigenvalues is A{W^) max{| A |: A e Spec(lF^), | A |< Rg 1 } 
(cf. Definiton 2.34). 

Theorem 3.35. Let Q = (V. E) be a friendly graph , and let k be a positive integer. 
If Ag \ k, then 7tg(k) = 0. For Ag \ k, we have 



(4.15) 
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Proof. This proof somewhat imitates the proof of the analogous result for zeta func¬ 
tions of function fields introduced by Emil Artin — where the Riemann hypothesis 
is known to be true by works of Helmut Hasse, Andre Weil and Pierre Deligne (cf. 
[38]). 

By Proposition 3.8, the case Ag \ k has already been treated, and we may assume 
that A g | k. By our definitions. 


&(w) = n 

fP]ePg 


—Tig (k ) 


k = 1 


This implies that 

OO 

log Zg(u) = ~^2,ng{k) log(l - u k ) 


k =1 

OO 


OO Lf OO 

M kt=r 


= J2 n s( k )J2~f = m !><?(*)•-’ 


k = 1 


1 =1 


m = 1 felm 


and consequently 

d 


u 


— log£g(w) = m ^ 1 ’Y^xg(k) ■ — — ^ u m ^A;7 Tg(k). (4.16) 

u M TYl 


m =1 /:|m m=l &|ra 

On the other hand, we know from (4.4) in Proposition 3.11 that 


U5 -lo8f eM = U -(^ 


(Vm(S) 


m=l 


m 


oo 

’) = E N ^ 1 


m—1 


which combined with (4.16) yields 


Nm(G) = ^2kng{k ). 

/:1m 


(4.17) 


(4.18) 


Using Mobius’ inversion formula, we obtain that 

”q^) = \Y.^) n ^ = \{ n ^+ E (4 - 19) 


rf|A: 


d|jfc,d<| 


Applying the two-term determinant formula in terms of Corollary 3.32, we deduce 
a third identity for u ^ log (g (u ), namely 

u ~chi IOg = U 7u IOg ( FI C 1 --*“)) 


AeSpec(W^) 


d 


Cl —■v 

= —W — > log(l — Xu) 

du L —' 


AeSpecCPF^) 
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d 



OO 


E E 

AeSpec(JTg) m= 1 


( Xu) m 
m 


OO 

= E E( A »'" 

AeSpec(tT^) m = 1 

OO 

= E ( E A "')“” 

m—1 A€Spec(T^) 


By comparing with (4.17), we see that 

N m (G) = £ AT 

AeSpec (Wg) 


(4.20) 


For large m the dominant terms in the last sum are the ones coming from A e 
Spec(JT^) with maximal absolute value, i.e., from the poles u of ^g(u) with minimal 
absolute value, as follows from Corollory 3.32. By Theorem 3.24 (i), these A have 
absolute value |A| = Rg l . By Theorem 3.24 (iii), we even know them exactly and 
they have the form 


2nik 

A = R~ l e ^ (k = l,...,A g ). 

Hence, by (4.20), 

A g 

,—. 2 nikm ,—. 

N m (g) = J2 R g me Ag + E A ™ 

k =1 AeSpec(tT^), \X\<A(Wg) 


For the main term in (4.21) we obtain 




2j xikm 

A Q 


k = 1 


R^ 


Kg 

E' 

k= 1 


Rg m Ag, if Ag | m, 
0, if Ag \ m. 


(4.21) 


(4.22) 


and the error term can be bounded as 


E 


A m 


AeSpec(Wg), |A|<A(Wg) 


< |Spec(W^)|A(0r =2\E\k(W § y n . 


(4.23) 


Combining (4.21), (4.22), and (4.23), we obtain in case Ag | k, as we may assume, 
that 


Nk(G) = Rg k Ag + 2\E\K{W § ) k = Rg k Ag + 0(A(W d ) k ), 


(4.24) 








158 


Jurgen Sander 


and for all d 


\N d m < Rg d Ag +2\E\HW § y = 0(Rg a ) 


-d\ 


(4.25) 


since Rg j > A ( Wg ) by definition. Inserting (4.24) and (4.25) into (4.19) yields 

ng(k) = ^(Rg k Ag + 0(A(W d ) k ) + 0( £ Rg d )) 


1 <d<\ 


= A g- 


R 


—k 


,A{W 5 )K (K a -s 


R, 


□ 


Example 3.36. Reconsider the graph /C 4 — e (cf. Example 3.2 and Exercise 3.17) with 

fa-eiu)- 1 — (1 — n 2 )(l — m)( 1 + w 2 )(l + u + 2n 2 )(l — u 2 — 2 u 3 ). 

Using (4.17), we obtain after a little computation 

00 d 

^ N m (Q)u m = u — log $g(u) 

m—l 

= 12 u 3 + 8n 4 + 24 u 6 + 28 u 1 + 8 u s + 48 u 9 + 120m 10 + 

+ 44m 11 + 104m 12 + 416t/ 13 + 280n 14 + <9(w 15 ). 


Now we apply the formula N m — N m (JC 4 — e) = J2k\m ^ n K. A -e (k) in (4.18) to 
compute small values of n(k) := 7t)c 4 - e (k): 


U = N 3 = 1- tt(1) + 3 • tt(3) = 3jt(3) 

8 = (V 4 = jt (l) + 27r(2) + 47t(4) = 47r(4) 

0 = N5 = 7r(l) + 5tt(5) = 5n(5) 

24 = N 6 = 7 T (1) + 2tt(2) + 3tz(3) + 6 rr( 6 ) = 12 + 6n(6) 
28 = N 7 = 7r( 1) + = 7 7T (7) 

8 = N&= 7r(l) + 2n{2) = 4tt( 4) + 8rr(8) = 8 + 8rr(8) 

48 = N 9 = n(l) + 3tt( 3) + 9;r(9) = 12 + 9rr(9) 

120 = N 10 = tt( 1) + 2jt(2) + 5tt(5) + IOjt(IO) = 10jr(10) 


=> ti( 3) = 4; 
=► Jf(4) = 2; 
=>■ 7t( 5) = 0; 
=> 77(6) = 2; 
=> Jr(7) = 4; 
=► 3r(8) = 0; 
=>■ tt(9) = 4; 
=> 7T(10) = 12. 


Exercise 3.37. 

(i) Identify the different primes in /C 4 — e according to Example 3.36. 

(ii) Use Example 3.16 to find 7Tjc 4 (k) for k = 3, 4, 5,..., 11. 


3.5 Some remarks on Ramanujan graphs At the end of Section 2.2 we in¬ 
troduced Ramanujan graphs as special expander graphs having applications in com¬ 
munication network theory (cf. [ 6 ]), and theoretical computer science (cf. [19]) in 
general. In this section we emphasize their importance regarding the Riemann hy¬ 
pothesis for graphs, i.e., the distribution of primes in graphs. 
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The asymptotic formula (4.15) has a “good” error term if the second largest 
eigenvalue of the edge adjacency matrix of the graph Q is “small”. For (q + 1)- 
regular graphs, in particular, this is true when the graph is a Ramanujan graph with 
A(Q) < 2 Jq, and this in turn is equivalent with the fact that the corresponding 
Riemann hypothesis is satisfied (see Theorem 3.20). The following theorem in fact 
shows that in this case the error term will be best possible if we consider a family of 
(q + l)-regular graphs whose vertex numbers go to infinity. 

Theorem 3.38 (Alon [2] and Boppana [8], 1986-87). Let (G„)neN be a sequence of 
{q + 1) -regulargraphs G n — ( V n , E n ) with lim„^oo \ V n \ — oo. Then 

lim inf A (Q m ) > 2 Jq. 

n—>oo m>n 


In 1988 Lubotzky, Phillips, and Samak [31] constructed such an infinite family 
of (q + l)-regular Ramanujan graphs for q = 1 mod 4 prime. Their proof uses the 
Ramanujan conjecture, which led to the name of Ramanujan graphs. Morgenstern 
[34] extended the construction of Lubotzky, Phillips, and Sarnak to all prime powers 
in 1994. The problem is open for general q. 

In Section 4.2 we shall take up the topic of Ramanujan graphs once again. 


4 Spectral properties of integral circulant graphs 

A graph is called integral if its spectrum is integral (cf. Definition 2.32). Up to now 
no handy criterion for this property is known (cf. [16]). Yet many classes of integral 
graphs have been identified and studied recently, in particular in the case when the 
graph is circulant (cf. Definition 2.28), meaning its adjacency matrix is circulant (see 
e.g. [1], [3], [22], [42], [45]). The class of integral circulant graphs displays rich 
algebraic, arithmetic and combinatorial features. In this section we shall focus on 
some arithmetical aspects of spectra of these graphs, including open problems and 
conjectures. Our main topics will be the Ramanujan property (see Definition 2.34) 
discussed in Section 3.5, and the concept of the energy of a graph. 

4.1 Basics on integral circulant graphs By Theorem 2.33 we know that inte¬ 
gral circulant graphs ICG(«, T>) = (Z„, E) are Cayley graphs characterized by their 
order n and a set T> of positive divisors of n and 

E := {(a, b) : a, b e Z„, (a — b, n) e X>}. 

The unitary Cayley graphs (cf. Definition 2.13 (ii)) are the integral circulant graphs 
with V = {1}. 

Let us recall some general facts about ICG(/?, T>) for arbitrary positive integers n 
and arbitrary divisor sets T> c D(n) {d > 0 : d \ n}. 


160 


Jurgen Sander 


Cayley graphs 



integral 
Cayley graphs 


circulant graphs 

integral circulant graphs 
= gcd graphs 


f unitary Cayley graphs = 

V gcd graphs with D={1} ) 1 






Figure 4.15. Subclasses of graphs among Cayley graphs 


Proposition 4.1. 

(i) Since ICG (n,T>) is circulant, its eigenvalues A/ f (n , T>) (1 < k < n), say, can 
be calculated by using Proposition 2.31, yielding 

X k {n,V) = ^ c(k, ^) (1 <£<«), (4.26) 

deV 

where 

c(k,n) := exp (~~) 

j mod n 

o»=i 

/.v the well-known Ramanujan sum ( cf. [4] or [44] ). 

(ii) ICG(/i, T>) is regular, more precisely Of/?. T>)-regular, where 

<b(n,V) = X n {n,V) = (4-27) 

deV 

with Euler’s totient function tp. This follows from Exercise 2.15 (iii), saying 
that a Cayley graph Cay(G, .S') is \S\-regular. 
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(iii) By the observation of So [46] that ICG(/?. T)) with T> = {d\ . d r }, say, 

is connected if and only if gcd(c/|,..., d r ) — 1, connectivity can readily be 
checked. If ICG(/?. T>) is connected, then 0(/;, T>) is the largest eigenvalue 
of ICG(/z, T>), the so-called spectral radius of the graph, and it occurs with 
multiplicity 1 (cf. Proposition 2.26 (ii)). 

Exercise 4.2. 

(i) Show that ICG(«, V) has loops if and only if n e V. For that reason, we shall 
usually require that T> c D*(n) := {0 < d < n : d \ n}. 

(ii) Deduce (4.26) from Proposition 2.31. 

(iii) Prove Proposition 4.1 (ii). 

Recently, Le and the author [28] observed that (4.26) can be rewritten as 

A k(tuT>) — (1 *£> c(k,-))(n) (1 < k < n ), (4.28) 

where 1 is the constant function and *d denotes the so-called V-convolution of arith¬ 
metic functions introduced by Narkiewicz [36], which is a generalisation of the clas¬ 
sical Dirichlet convolution. This representation of the eigenvalues will be of great 
importance for our purpose. 

In general, given non-empty sets A(n) C Din) for all positive integers n, the 
( arithmetical ) convolution A or the A-convolution of two arithmetic functions f, g e 
C N is defined as 

C f*Ag)(n)= 

deA(n) 

All convolutions considered in the literature are required to be regular, a property 
which basically guarantees that / *a g is multiplicative for multiplicative functions 
/ and g, and the inverse of 1 with respect to *a, i.e., an analogue of the Mobius 
function /i, exists. Narkiewicz [36] proved that regularity of an A-convolution is, 
besides some minor technical requirements, essentially equivalent with the following 
two conditions: 

(i) A is multiplicative, i.e., A(mn) = A(in)A(n) := {ab : a e A(m). b e A ( n )} 
for all coprime m , n e N. 

(ii) A is semi-regular, i.e., for every prime power p s with .s > 1 there exists a divisor 
t — tA(p s ) of s, called the type of p s , such that A(p s ) = {l, p‘, p 2t p jt } 
with j = j. 

Both of these properties will occur in a natural fashion along our way, but for our 
purposes concerning the eigenvalues of ICG(n, T>) the multiplicativity of the divisor 
sets V will be the guiding feature. 

The product of non-empty sets A\,...,A t of integers is defined as 

t 

n A i ■= {«t 

i — 1 


a t : at e A, (1 < i < t)}. 
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For infinitely many such sets A 1 , A 2 ,..we require that A ,■ = {1} for all but finitely 
many i and define 

oo oo 

ff. = n- 4 ' 

i=l i=l 

Ai*{ 1 } 

Let us call a set A of positive integers a multiplicative set if it is the product of non¬ 
empty finite sets Af C {1 , pt, pf, pf,...}, 1 < i < t say, with pairwise distinct 
primes p\,...,p t . In other words, a given set A is multiplicative if and only if 
A — ]~1 pevAp, where A p := { p e e^ : a e A} for each prime p, and e p (a) 
denotes the order of the prime p in a. Observe that A p ^ {1} only for those finitely 
many primes dividing at least one of the a e A. 

If D is a multiplicative divisor set, then this property is extended to the spectrum 
of the corresponding integral circulant graph. 

Proposition 4.3 (Le and Sander [28], 2012). For a multiplicative divisor set T> = 

Wp\n v p Q D (”)> 

A k (n,V)= f] A k (p e » {n \V p ), (4.29) 

pe P, p\n 

where for each prime p dividing n the integers A^( p ep ^ n \ T> p ), 1 < k < p e p( n \ are 
the eigenvalues oflCG(p ep ^ n \'D p ), and A k{p Sp ^ n \ T^p) is defined for all integers k 
by periodic continuation mod p e rA ,l \ 

4.2 Ramanujan integral circulant graphs In 2010 Droll [13] classified all 
Ramanujan unitary Cayley graphs ICG(//, {1}). We extend Droll’s result by drawing 
up a complete list of all graphs lCG(p s ,T>) having the Ramanujan property for each 
prime power p s and arbitrary divisor set 'D. In order to do this we shall have to check 
for which 'D c D*(p s ) the Ramanujan condition 

A(pfV) := A(ICG(/,D)) < fpFV)- 1 (4.30) 

is satisfied (cf. Definition 2.34 and Proposition 4.1 (ii)). Since Ramanujan graphs 
are required to be connected, we necessarily have 1 e V for a Ramanujan graph 
ICG(, 'D) due to the criterion of So (cf. Proposition 4.1 (iii)). While (4.27) provides 
an explicit formula for <$>(p s ,V), it is just as important to have such a formula for 
A (p s ,V). Asa tool to prove such a formula we first show 

Lemma 4.4. Let p s be a prime power and V — {p a> , p ° 2 ,..., p Ur ~ l , p ar } with 
integers 0 < a\ < 02 < • • • < a r -\ < a r < s. Then 

r r 

A k (p s ,T))= J2 V(p s ~ ai )~ Y, P^ 1 (4 ’ 31) 

i = 1 i — 1 

a t >s-j a t —S j 1 

for k e {1,2,..., p s }, where j := e p (k). 
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Proof. By (4.26) in Proposition 4.1, we have for k e {1,2 p s ) that 

h (pW) = X c ( k ’ ~f) = E c ^p s ~ ai )- 

deV i =1 


(4.32) 


We use two well-known properties of Ramanujan sums (cf. [4] or [44]), namely 
c(k, n) = c(gcd(A,«), /?) for all k and and 

{ <p(p v ), if u >v, 

-p v ~ l , if u = v — 1, 

0, if u < v — 2, 


for primes p and non-negative integers u and v. On setting m := -jy, i.e. k = p J m 
with 0 < j < s and m > 1, p j m, it follows that 


c(k,p s - ai ) = c(p J m, p s ~ a ') 


= c{p m ' a Up s ~ a ') = 


< p(p s ~ a ‘ ), if,/' > .v -a t , 

-pS-di-l, if j = s- ai - l, 

0, if / < .y — at — 2. 


Inserting this into (4.32), we obtain (4.31). 


□ 


Exercise 4.5. Check the identities for Ramanujan sums stated in the preceding proof. 

Proposition 4.6. Let p s > 3 be a prime power, and let V c D(p s ) with 1 e V. 
Then A(2 S , {1}) = 0 for s >2, and in all other cases 

A( P w) = p s - 1 - X>(y)- 

deV 

d+\ 

where the empty sum for T> = {1} vanishes. 

Proof. Since 1 e D, we have V = {p ai , p a2 ,..., p ar }, say, for suitable integers 
0 = a\ < a 2 < ■ ■ • < a r -\ < a r < s. Writing k = p J m, p \ m. with suitable 
integers j , 0 <j <s, and m > 1 for each k e {1,2,..., p s }, we obtain, by Lemma 
4.4 (= 4.2 in [29]), that 


A k {p s ,V)= X <P(P S ai )~ X P S ^ '• (4.33) 

i — 1 i — 1 

a i >s~j ai=s-j -1 

We distinguish several cases. If j — s , that is k = p s , we have Xk(p s ,T>) = 
<&(p s ,T>), which is the largest eigenvalue of ICG( p s , V) (see (iii)) and thus irrelevant 
for the determination of A{p s , V). Hence we are left with the following three cases, 
the last two of which correspond with the cases in the proof of Proposition 4.1 in [29] : 
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Case 0: 0 < j < s — a r — 2. 

Observe that this only occurs if a r < s — 2. Then (4.33) implies 


X k (p s ,V) = 0. 


Case 1: s — ag < j < s — an-\ — 2 for some 2 < l < r. 

Then 

r 

x k (p\V) = Y <p(p s ~ ai ) = *(p s ,v(P at ))> 0 

i = l 

at >at 

with D(x) {d e V : d > x}. 

Case 2: j = s — ag — 1 for some 1 < l < r. 

Then, on setting rz r +i := 5 + 1 and consequently <&{p s ,D(p ar+1 )) = 0, 
we have 


X k {p s ,V) 


Y <p(p s ~ ai ) - p s ~ ai ~ l 

i = 1 

a-i >«£ +1 

0(/,X>(^+ 1 )) - < 0, 


such that \X k (p s , V)\ — p s ai 1 — <i>(p s , V{p ai + X )). 


Let us start by looking at the special case r = 1, i.e., V — {1}, a\ = 0 and s > 
1. This means that Case 1 never occurs. Case 2 does occur for j =5 — 1, with 
corresponding X k (p s , V) — p s ~ l . This value equals 0(2^, {1}) = cp(2 s ) = 2 s ~ l in 
case p — 2 (and s > 2, since .v = I is outruled by our condition p s > 3), hence 
A (2 s ’, {1}) = 0 (originating from Case 0). For any p > 3 and all s > 1, we have 
p s ~ l < <&(p s , {1}), which proves the proposition in this situation. 

From now on, we assume that r > 2. We define 


m = mx> ■ = 


oo, if V is uni-regular, 
min l, otherwise. 

2 <l<r 
ai-at- 1>2 


(4.34) 


For vix> — oo, Case 1 does not occur at all. Obviously, 

<d (p s ,v) > ®(p\v(p a n) > Hp s ^{p ai+l )) > o 


for 1 < l < r — 1. If mx> < oo, Case 1 occurs for l — vix>, but for no smaller l. 
Therefore, the only candidate for A(p s , T>) originating from Case I is <fi( p s . 'D(p a ™ )). 
On the other hand, it is easily seen that 

0(/,22) > p s - a t- 1 - <D(/,D(^+i)) > p s ~ a w~ l - <&(p s ,V(p ai + 2 )) > 0 
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for 1 < l < r — 1. Hence, the only candidate for A (p s ,V) originating from Case 2 
is p s ~ ai ~ l — $(p s ,V(p a2 )) = p s_1 — <i>(p s ,V(p a2 )). Now let us compare the two 
candidates for A (p s ,T>) found above. By the definition of m — m -p, we obtain 

r r 

<$>(p s ,V(p a2 )) + <D(/ ,V{p a ™)) = £>(/"«') + J2 (p(p s ~ ai ) 

i =2 i —m 

m— 1 r 

i =2 i—m 

r 

= p s ~ a2 - p s ~ am - 1- 1 + 2 J2 v(p s ~ ai ) 

i—m 

< p s ~ a2 - — p s ~ a m- 1 -1 _)_ 2 p s ~ a m < 

This implies /r 5-1 — <$>{p s ,V(p a2 )) > Q(p s , V(p am )), and we have 

A(/,2>) = /- 1 - <*(/>*, W 2 )) = /- 1 - ^ ^(y)- □ 

deV 
dj= 1 


Corollary 4.7. Let p s > 3 be a prime power, and let T> C D(p s ) with 1 e D. 
(i) '/'/ten 


<&(/>*,2>) + A(p s ,V) 


2 s J , if p = 2, s >2 andV = {{}, 
/r 5 , otherwise. 


(ii) For all odd primes p, we have A(p s ,T>) = 0 if and only ifT> = D(p s ). 


Proof. The identity (4.35) follows right away from (4.27) and Proposition 4.6. The 
second assertion is another consequence of Proposition 4.6, because 



deV 

d^l 


with equality if and only if V = D(p s ). 


□ 


Corollary 4.8. Let p s >3 be a prime power, and letT) C D*{p s ) with 1 e D. Then 
ICG( /r\ T>) is Ramanujan if and only if either p = 2, s > 2 and T> = {1} or, in all 
other cases, 

vifj) - pSl ~ 2 '/P I+2 ’ (4 - 36) 

deV 

d+\ 

where the empty sum for T> = {1} vanishes. 
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Proof. The special case p = 2, s > 2 and 2? = {1} is an immediate consequence of 
Proposition 4.6. From (4.30) and Proposition 4.6 it follows in all other situations that 
ICG (p s ,T>) is Ramanujan if and only if O 2 := Yldev d^i Tiyf) satisfies 

p s ~ l - $2 < 2- s /0(p J , V) - 1 = 2 y/(p{p s ) + 4> 2 — 1. 

A short calculation reveals that this is equivalent with (4.36). □ 

Expanding our definition of D(n), we set D(n\ m) {d e D{n) : d > m} 
for any positive integer m. Now we are able to show precisely which ICGs of prime 
power order do have the Ramanujan property. 


Theorem 4.9 ([30]). Let p be a prime and s a positive integer such that p s > 3. Let 
'D C D*(p s ) be an arbitrary divisor set of p s . Then ICG( p s . T)) is a Ramanujan 
graph if and only ifT> lies in one of the following classes: 

(i) V = D(p^^~ l ) U V for some V C D(p s ~ l \ p^ 2 I); 

(ii) T> = {1 } in case p = 2 and s > 3; 

(iii) T> = D(p^) U V such that \T>\ > 2 for some V c D(p s ~ l \ ) in case 
p e {2, 3} and s > 3 odd; 

(iv) V = 0(2^) U V for some V c D{2 s ~ l \li) satisfying 0 V ^ {2 5-1 } 
in case p = 2 and s > 4 even; 

(v) = {1,2 2 , 2 3 ,2 4 } in case p — 2 tmr/.v = 5; 

(vi) = £)(5 t~) U {5^~~}UD' for some V c £)(5 ,s ~ 1 ; 5^“) in case p — 5 
and s > 5 odd; 

(vii) T> — D( 2T 5 ) U {2T 1 } U V for some V c £)(2 ,s_1 ; 2^“) satisfying 


3-2^2 + 


5—3 

2 ^ 


... -y-i x—\ 

<2 2 £ 

d'eV' 


1 

d' 


(4.37) 


in case p = 2 and s > 5 odd. 


Theorem 4.9 tells us for each prime power in a simple way how to choose divisor 
sets to obtain an integral circulant graph that is Ramanujan, except for the final case 
(vii), where the construction is a little more intricate. In this situation, the binary 

s — 3 

expansion of the left-hand side of (4.37) is obtained by adding 2 2 ~~ to the binary 

expansion of the real constant 3 — 2 a/2 = (0.0010101111101100001.. ,) 2 - Obvi¬ 
ously, the sum on the right-hand side of (4.37) is a binary expansion by construction, 
and in order to generate a Ramanujan graph we just have to pick V appropriately. 
Let us sketch an example that illustrates what to do explicitly in case (vii): 

Consider the prime power 2 29 , for which condition (4.37) turns into 

1 r 1 
3 — 2 V 2 + — = (0.0010101111110100001.. .) 2 < — 14, 

i = 15 
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with V = { 2 a 15 , 2 " 16 ,..., 2 ar }, say. The parameter r > 15 may be selected arbi¬ 
trarily so that a r < 28. Now for each choice of the a,-, 15 < i < r, it is completely 
obvious whether the corresponding divisor set V generates a Ramanujan graph or 
not. For instance, the choice a\ 5 = 17, ai 6 = 19, <217 = 21, a is = 22, ci \9 = 23, 
a 2 0 = 24, « 2 i = 25, a 22 = 26, a 22 = 27 generates the Ramanujan graph 

ICG(2 29 , D(2 12 ) U {2 14 ,2 17 ,2 19 ,2 21 ,2 22 ,2 23 ,2 24 ,2 25 ,2 26 ,2 27 }), 
while the divisor set of 

ICG(2 29 , D(2 12 ) U {2 14 ,2 17 ,2 19 ,2 21 ,2 22 ,2 23 ,2 24 ,2 25 ,2 26 ,2 28 }) 

violates (4.37), and thus the graph is not Ramanujan. 

Theorem 4.9 (i) immediately implies the following statement. 

Corollary 4.10. For each prime power p s > 3 and all divisor sets V — 
{1, p r ~ 1 } with | < r < s, the graph lCG(p s ,T>) is Ramanujan. In par¬ 

ticular, there is an integral circulant Ramanujan graph ICG( p s , T>) for each prime 
power p s > 3. 

We like to make the reader aware of the fact that the divisor sets ensuring the 
Ramanujan property in Corollary 4.10 are uni-regular, as introduced in [29], where 
a subset of D{p s ) is called uni-regular if it consists of successive powers of p. 

Proof of Theorem 4.9. Let us assume that V = {p ai , p a2 ,..., p ar } for suitable in¬ 
tegers 0 < < a\ < a 2 < • • • < a r -i < a r < s — 1. Since Ramanujan graphs have 

to be connected by definition, i.e., 1 e V, we necessarily have a 1 = 0. 

We consider the case r = 1, i.e., T> — {1} separately. We know by Corollary 4.8 
that ICG(2' , {1}) is Ramanujan for each .v > 2. For p > 3, the Ramanujan property 
requires p s ~ 1 — 2 p s ^ 2 + 2 < 0 by condition (4.36) of Corollary 4.8. It is easy to 
check that this inequality holds if and only if .y = 1 or s = 2. For r — 1, we thus 
have exactly the following types of Ramanujan ICGs: 



P s 

V 

A 

2 s (s > 2) 

{1} 

B 

P (P> 3) 

{1} 

C 

P 2 {P > 3) 

{1} 


In the sequel we may assume that r >2. Since a r < s — 1, we trivially have 
r 

^v(p s ~ ai ) < {p s - a2 -p s - a2 - l )+(p s - a2 - l -p s - a2 ~ 2 )+---+(p-\) = p s -° 2 - 1 . 
i= 2 

Hence, by Corollary 4.8, a necessary condition for lCG(p s ,T>) to be Ramanujan is 

that _ 

p s ~° 2 > p s ~ x -2^y+2. (4.38) 
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Let us first distinguish cases according to the value of a 2 . If 02 > 3, thus s > 4, 
a simple calculation shows that (4.38) is never satisfied. Assume next that a 2 = 2, 
thus s > 3. It is easily seen that (4.38) requires s < 5, and we obtain only the pairs 
(s, p) e {(3,2), (3, 3), (4,2), (5,2)} satisfying (4.38). Checking our Ramanujan con¬ 
dition (4.36) explicitly for these pairs, we precisely obtain Ramanujan ICGs as listed 
below: 



P s 

V 

D 

p 3 , 2 < p < 3 

{1 -P 2 } 

E 

2 4 

{1,2 2 } or {1,2 2 ,2 3 } 

F 

2 5 

{1,2 2 ,2 3 ,2 4 } 


We are left with the case a 2 — 1. Using the notation introduced in (4.34), it 
follows that nip > 3, and we shall distinguish between nip = oo and nip < oo. In 
the first case, i.e. for uni-regular divisor sets T>, we have s >2 and 

r r 

E ^~ fl ') = = <p(p s ~ 1 )-<p(p s - t )- 

i—2 i=2 

Then Corollary 4.8 tells us that ICG {p s ,T>) is Ramanujan if and only if 

p s ~ r +2 (4.39) 

For s < 2 r, this condition is satisfied for all primes p. hence we obtain a Ramanujan 
graph 1CG( p s , V) for each prime power p s with s > 2 and uni-regular divisor sets 
V = {1 p r ~ 1 }, where | < r < s. For s > 2r + 2, condition (4.39) does not 

hold for any prime p. For the missing case s — 2r + 1, (4.39) yields the Ramanujan 
condition ^fp + < 2, which is only satisfied for p = 2 or p — 3 and all odd 

s > 5. Therefore, we have two more types of Ramanujan ICGs: 


P s 

V 

G ^(peP,s>2) 

{\, p -- p r ~ 1 } (min{2,§}</-< 5) 

H p s {p e {2, 3}, 5 > 5, 2 { s) 



Now let us consider the case m = m p < oo. We have 
r m— 1 r 

Y.v( P s - a ‘) = E ( p(P s ~ ai )+ E ^p s ~ ai ^ 


i= 2 


i= 2 


= p s ~' - p+ e <p(p s ~ ai )- 


i—m 
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Then, by Corollary 4.8, ICG (p s ,V) is Ramanujan if and only if 

r 

p s a m —\ i + 2 < 2y /y + <p(p s ~ a ‘)- (4.40) 

i—m 

For y < 2(a m -i + 1), i.e., | > s—a m -\ — 1, the Ramanujan condition (4.40) is satis¬ 
fied for all primes p. By the definition of m = nip, we have a m -\ = m — 2. Hence 

lCG(p s ,V) is Ramanujan for any prime power p s and V = {1, p . p m ~ 2 , p am , 

..., p a ' }, if 3 < m < r satisfies 5 < 2(m — 1) and a m > m, which requires s > 4. 
Setting l m — 1, we have found the next type of Ramanujan ICGs: 


p s V 


I /(^eP, s > 4) {l,p,...,p e - 1 ,p a t+p...,p ar } 

(§ < l < «£+l - 1) 


For 5 > 2(a m _i + 3), i.e. | < s — a m -\ — 3, we obtain for all primes p 


P+-7Z. 


s a m —1 1 


> 2 > 


a ii _ a 

)2 + l 2 p? 


nS a m —i 2 


+1 > 


a m —i 2 


+1, 


hence 


yS a-ni— 1 1 _j_ ^ — p s a m— 1 2 


P + 


> P 


, s a »i - I 2 


F 

2 pi 


+ 1 = 2pi + p 


s O-m—X 2 


> 2pi + p s ~ am > 2y/y + J 2 v(p s ~ a ‘ )• 


This contradicts (4.40), which means that there are no Ramanujan graphs in this situ¬ 
ation. 

It remains to consider the three special cases s = 2(c/ m _ i + 1) + j with j e 
{1,2, 3}. This turns (4.40) into the Ramanujan condition 


j 

P 7 


-2 < 


!>(/-*' )- 2 


(4.41) 


By the definition of m — nip, we know that a m > a m -\ + 2. Since 


r 

£ ^p s ~ ai ) ± p s ~ am - 1 ^ p Ua m—l + l)+7 — (^m —1 +2) _ ^ — p a >n—l +7 _ J 
i—m 











170 


Jurgen Sander 


careful calculations reveal that condition (4.41) can only be satisfied for the pairs 
(j, p) e {(1,2), (1,3), (1,5), (2,2), (3,2)}. For the first two pairs, corresponding 
with a m -1 — ^2, the left-hand side of (4.41) is negative, while the right-hand 
side is non-negative except for the special case p — 2 and a m — a r — s — 
1. However, a closer look shows that in this latter situation (4.41) is still satis¬ 
fied, and we obtain Ramanujan graphs ICG(/r v . T>) for p e {2,3}, where T> — 

{1, p . P^,P ^ i , • • •, P ar } with a.s+\ = a m > a m -i + 2 = ^=2 + 2 = s -^~, 

thus s > 5. Inserting the third pair j = I and p = 5 into (4.41), a short computation 
discloses that the corresponding ICGs are Ramanujan if and only if a m = a m -1 + 2, 

g ^ iS - 1 - 1 

and then T> — {{1,5,..., 5t“, ,..., 5 ar } with 5 > 5. To sum up, the Ramanu¬ 

jan ICGs for (j, p) e {(1,2), (1,3), (1,5)} are the following: 


p s V 

J p s (p e {2,3}, s > 5, 2 { s) {1 ,p . 

K 5 s (s > 5, 2 \ s) {1,5.5^,5^,5°^,...,5 a ''} 


For j — p = 2, the Ramanujan condition (4.41) reads Y^i=m P s a ‘) — 2, and 
this always holds unless a m — a r =5 — 1, i.e., we obtain Ramanujan ICGs of type 


p s V 


L 2 s (s>6, 2\s) {l,2,...,2^,2 a 3,. 

CN 

1 

VI 

05 kN 

Q 

VI 

05 led 

V. 

<3 

(N 


We are left with the case j = 3 and p = 2, where s — 2a m -\ + 5 = 2 (m — 2) + 
5 = 2m + 1, and our Ramanujan condition (4.41) becomes 


2s/2-2< 


1 


r ) a m — \ + 2 \ 

^ \l —m 




2 TO V2 \_ 


±r-°‘-'-A = U±^_ 


\/2\ _ 2a ‘~ m 2 m ~ l J 


Apparently, this is equivalent to 4 — 2\/2 < 2a .L m — 2 J_ 1 , where we know 

that cn > i for all i > m . This immediately implies that the Ramanujan condition 
necessarily requires that a m — m. Then our condition reads 


3 — 2\f2 < J2 


i=m +1 


2 a r 


1 _ A 1 l 

~~ . 2 ai ~^ 2"?’ 


(4.42) 
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and we have found our last type of Ramanujan ICGs: 


V 


M 2 s (s>5, 2\s) {1,2.2T 1 ,2V L ,2 a ' S ^ i . 2 a 

{as+i ,. ■ ■ ,a r satisfy (4.42)) 


To complete the proof of Theorem 4.9, it remains to show that the Ramanujan ICGs 
of types A-M exactly constitute the Ramanujan graphs listed in the assertion. The 
following table provides the interrelations between types A-M on one hand and cases 

(i)-(vii) on the other hand. 


(i) A (s = 2), B(s= 1), C (s = 2), G (s> 2), I (s > 4) 

(ii) A (s > 3) 

(iii) D (.v = 3), H (s > 5 odd), J (s > 5 odd) 

(iv) E (.S' = 4), L (5 > 6 even) 

(v) F (.s' = 5) 

(vi) K (s > 5 odd) 

(vii) M (s > 5 odd) □ 


Applying the concept of multiplicative divisor sets introduced at the end of Sec¬ 
tion 4.1, Corollary 4.10 can be extended to integral circulant graphs whose order is 
an arbitrary composite number. It has recently been shown that for every even integer 
n > 4 and a positive proportion of the odd integers n , namely those having a “domi¬ 
nating” prime power factor, there exists a divisor set T> c D*(n) such that ICG(«, D) 
is Ramanujan. Unfortunately, the corresponding assertion cannot be confirmed for all 
odd 77 by use of the tool of multiplicative divisor sets. In fact, it will transpire that 
the set of odd n for which no Ramanujan graph ICG(/7, T>) with multiplicative divisor 
set V exists has positive density. This means that one would definitely need to study 
integral circulant graphs with non- multiplicative divisor sets in order to finally settle 
the matter. 

At the end of this section we just state and discuss some further results and finally 
ask a couple of questions. 

Theorem 4.11 ([39] (2013)). For each even integer n > 3 there is a ( multiplicative ) 
divisor set T> c D*{n) such that ICG(7i, T>) is a Ramanujan graph. 

If n is an odd integer, a multiplicative divisor set T> c D*{n ) such that ICG(«, T>) 
is Ramanujan can only exist if n has a comparatively large prime power factor. This 
will be specified in detail by the following theorem. To this end, we define for every 
integer n > 1 its largest prime power factor PP(/7) := max pe p ;/7 |„ p e p( n \ and we 

shall say that n has a dominating prime power factor if PP(/r) > jt? . 
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Theorem 4.12 ([39] (2013)). Let n > 3 be an odd integer. 

(i) If n has a dominating prime power factor, then there is a ( multiplicative) divi¬ 
sor set T> C D*(n) such that ICG(«, T>) is a Ramanujan graph, 

(ii) If n > 8259 does not have a dominating prime power factor, then there is no 
multiplicative divisor set T> C D*(n) such that ICG (n,T>) is a Ramanujan 
graph. 

There are a little less than 200 odd integers n < 8259 without dominating prime 
power factor, the smallest being 315 = 3 2 • 5 • 7 and the largest 8211 = 3 • 7 • 17 • 23. 
For each of these n we assured ourselves that ICG(n.D) is not Ramanujan for the 
“canonical” multiplicative candidate V \f[ pe? p n Tip to produce a Ramanujan 

integral circulant graph, namely by choosing V q = D*(q e, T"' ) ) for the largest prime 
power q e< iW — PP(/?) of n and T> p — D(p ep ^) for all other primes p \ n. Check¬ 
ing, however, all possible multiplicative divisor sets for all those n would require an 
enormous computational effort, which is not justified by whatever the result is. 

While Theorem 4.11 confirms the existence of a Ramanujan graph ICG(/t, V) for 
all even n, Theorem 4.12 leaves unanswered the question if for every odd n there is 
a Ramanujan integral circulant graph of order n. However, Theorem 4.12 (ii) does tell 
us that in order to deal with this problem one would have to study integral circulant 
graphs with non-multiplicative divisor sets T> for infinitely many n. Our final result 
discloses the proportion between the odd n for which ICG (n.V) is Ramanujan for 
some multiplicative divisor set T> C D*(n) and those odd n without this property. 
More precisely, it turns out that both cases occur with positive density among all odd 
integers. 

For this purpose, let k(x) denote 

#{n < x : 2 \ n, ICG (n.V) Ramanujan for some multiplicative T> C D*(n)\. 

It can be shown that lim^oo exists and is positive. Notice that the set of odd 
prime powers below x, for each of which there is some Ramanujan ICG by Corollary 
1.1 in [30] (see also Proposition 4.10), does not suffice to warrant this result, since it 
has density zero in the set of positive integers. By use of methods and results from 
analytic number theory, one can show 


Theorem 4.13 ([39] (2013)). For x -» oo, 



Corollary 4.14. We have 


lim —— = log 2, 

X^-OO 4 


i.e ., the density of odd positive integers n for which ICG (n,T>) is Ramanujan for 
some multiplicative divisor set T> C D*(n) in the set of all odd positive integers 
equals log 2, while the corresponding density of odd positive integers n such that no 
ICG(«, T)) with multiplicative T> is Ramanujan equals 1 — log 2. 
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Open problems: 

• Given a (sufficiently large) positive integer n, is there some 'D C D*(n) such 
that ICG(n, V) is Ramanujan? 

• How do non-multiplicative divisor sets such that the corresponding integral 
circulant graphs are Ramanujan look? 

4.3 The energy of a graph In 1978 Gutman [15] introduced the mathematical 
concept of the energy 

E(G) := Y, W' 

AeSpec(C/) 

of a graph Q, though this concept is rooted in chemistry way back in the 1930s (see 
[32] for connections between Hiickel molecular orbital theory and graph spectral 
analysis, and [9] for a mathematical survey). 

We denote the energy of an integral circulant graph by 

n 

S(n,V):= E(lCG(n,V))= Y |A| = £ \k k (n, V)\. (4.43) 

AeSpec(ICG(n,T>)) k = 1 

It is of particular interest to determine for any fixed positive integer n the extremal 
energies 

fmin(n) := min{£(/7,X>) : V c D*(ri)} 


and 


^max(») := ma x{E(n,V) : V c D*(n)}, 

as well as the divisor sets producing these energies. For prime powers n — p s this 
problem was completely settled by the author and T. Sander in [40], Theorem 3.1, 
and [41], Theorem 1.1, respectively. The basis of these results was an explicit for¬ 
mula evaluating £(p s ,V) (cf. [40], Theorem 2.1). Due to the lack of a comparable 
energy fonnula for arbitrary n, it seems much more difficult to deal with £ mm (n ) and 
£max(« ) in general. Unfortunately, the energy reveals no sign of multiplicativity with 
respect to n. We shall overcome that deficiency by using the concept of multiplica¬ 
tive divisor sets (cf. Section 4.1 and [28]), which is closely linked with the eigenvalue 
representation in (4.28). This will at least provide us with good bounds for £ mm (n) 
and £ mm (n) as well as divisor sets producing the corresponding energies. In case 
of the minimal energy, we even conjecture to have found £ m i n (») and its associated 
divisor sets for all n. 

By use of (4.28) and Proposition 4.3, Le and the author proved in [28], Theo¬ 
rem 4.2, that given a multiplicative divisor set V C D(n), we have for the energy of 
ICG(n, V), as defined in (4.43), that 

S(n,V)= n £(P ep(fl) >Vp)> 

pe P, p\n 


(4.44) 
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where the sets T> p are defined as in Proposition 4.3. We shall investigate minimal and 
maximal energies of integral circulant graphs with respect to multiplicative divisor 
sets. To this end, we define 

Smmin) := min {S(n,T>) : T> c D*(n) multiplicative} 


and 

S max .i n ) : = max{£(«, T>) : T> C D*(n) multiplicative}. 

Clearly, 

in in) < 

^min in) — ^max in) — ^max in) (4.45) 

for all positive integers n. Moreover, for prime powers p s any divisor set V c D(p s ) 
is trivially multiplicative, hence £ mm (p s ) = £ min ip s ) and £ max (p s ) = S max (p s ). 

We shall hrst state all our results and provide the corresponding proofs later. 


Theorem 4.15 ([29] (2012)). Let n >2 be an integer with prime factorisation n — 
p\ l . p s f. Then 

t 

£max(«) = n 6 ip?)’ 

i = 1 

where for any prime power p s 

ff+vp (( s + 1)( ^ 2 ~ X) P S + 2 iP S+1 ~ ’ '/ 2 + s > 

(sip 2 - 1 )p s + 2(2p s+1 - p + p 2 - p - D), if 2 | 

Moreover, for multiplicative sets T> c D*(n), we have Sin, T>) = £ max in) ifandonly 

ifv = n' = l £> (0 with 


V {i) = 


(! 1 ■ Pf-Pv 

{ 1 , 2 , 2 4 , 

lu .pf.pt 


Si— 3 Si — 1 . 

■’Pi ’Pi }. 

,2 s ’- 3 , 2 Si ~ 1 } or {\,2, 2 3 . 

Si—4 Si—2 Si — 1 


■ Pi 


.., 2 Si _4 ,2 s '- 2 ,2 s '- 1 }, 

7-3 ! 


}or{l,^,u 3 , 


if 2. \ si , pi > 3, 

;/2 j j,-, pi = 2, 


It should be observed that the maximising factor divisor sets are (almost) 
semi-regular (cf. (ii) at the end of Section 4.1). 

In order to describe the multiplicative divisor sets minimising the energy, let us 
call T> C D{p s ) for a prime power p s uni-regular, if T> = {p u , p u+l ,..., z? 1 ' -1 , p v } 
for some integers 0 < u < v < s, which is a special case of semi-regularity if u = 0 
and v = s. 
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Theorem 4.16 ([29] (2012)). Let n > 2 be an integer with prime factorisation n = 
/tj 1 . pf and P\ < P 2 < ■ ■ ■ < Pt- Then 




1 - 

Pi 


Moreover, for multiplicative sets T> C D*(n), we have £(n,T>) = £ m in (n) if and only 

ifV — ]~[| = i w ith = {p™} for some u e {0, 1 . sq — 1 } and arbitrary 

uni-regular sets T>^ with p\ l e T>^ fori <i<t. 

Remarks 4.17. 

(i) According to Proposition 4.1 (iii), ICG(/?, V) is connected if and only if the 
elements of V are co prime. Assuming connectivity in Theorem 4.16, min¬ 
imising sets V — nU 27^ then necessarily have T>^ — {1} and X>h) = 
{1, pi, pf,..., p s f } for 2 < i < t, i.e., all X>h) w ith 2 < i < t have to be 
semi-regular sets of type 1. 

(ii) The proof of Theorem 4.16 reveals that for T> C D(n), i.e., possibly n e V, 
we would have T mm (/0 = n with minimising sets 'D = \\ l = } 'D u \ where each 

is an arbitrary uni-regular set containing p s f . 

(iii) One could prove Theorem 4.16 by using the energy formula for lCG(p s ,T>) 
containing loops (cf. [28], Proposition 5.1). Instead we shall take a closer 
look at the second largest |A| with A e Spec(ICG (n,T>)). This provides more 
insight into the underlying structure. 

As a consequence of (4.45), we immediately obtain from Theorems 4.15 and 4.16 
the desired bounds for the extremal energies of integral circulant graphs with arbitrary 
divisor sets. 

Corollary 4.18. Let n >2 be an integer with prime factorisation n = /fj 1 . pf 

and pi < p2 <••• < Pt- Then 


(i) £max(») > £max(») = f[i = l 0 OiP)>' 

(ii) £ mm (n) < Trainin') = 2n(l - 


Examples show that, while equality between T lmix (n) and T rniix (/i) does occur oc¬ 
casionally, we usually have £ max (/?) > T lmx (n) (cf. Example 4.20). This phenomenon 
is due to the fact that maximising divisor sets normally are not multiplicative. Yet 
£ ma x(« ) falls short of T max (n) by less than a comparatively small factor. To state this 
result, we denote by tp Euler’s totient function, by tin) the number of positive divi¬ 
sors of n, and by co(n) the number of distinct prime factors of n. As before, e p (n ) 
denotes the order of the prime p inn. 
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Theorem 4.19 ([29] (2012)). Let n be a positive integer. Then 

f. , w Vi4)x{d) 

(i) ^maxvO ^ M ^ 

d\n 

= « n (K 1 ~ p)( e p( n ) + l )( e p( n ) + 2 ) + j) ; 

/7€P, /?|ft 

(ii) ^max(«) < (|)" ( ' !) « r(«) 2 ; 

(ill) ^rnax («) ^ ^max («) r(n). 

The proof of Theorem 4.19 (ii) will show that 

(} f n) x(n) 2 < £ < (f)^ r(n) 2 , (4.46) 

d\n 

and (ii) is deduced from (i) by use of the upper bound in (4.46). We like to point 
out that the constants ^ and | in the lower and upper bound of (4.46), respectively, 

cannot be improved for all n, although we expect Yld n % (t )“("> T (n) 2 for 

most n. Yet, each of the bounds is sharp for integers with certain arithmetic properties 
(cf. Remark 1). 

As far as the magnitude of £ max (//) is concerned, it is well known that r(n) — 
0(n s ) for any real e > 0. In fact the true maximal order of r(n) is approximately 

log 2 

n i>>g|<>E«. On average, r (n) is of order log //, but for almost all n it is considerably 
smaller, because the normal order of r(«) is roughly (log/7) log2 ss (logtj) 0,693 . For 
all these results as well as for the average and normal order of co(n) the reader is 
referred to [17], §§ 18.1-18.2, § 22.13, and § 22.11. 

As opposed to sets maximising the energy of integral circulant graphs, numerical 
calculations suggest that divisor sets minimising the energy are always multiplicative. 
According to that, equality in Corollary 4.18 (ii) should hold for all n. We shall 
specify this observation in two conjectures at the end of this section. 

Now we start to prove all results stated so far in this section. 

Proof of Theorem 4.15. Let n — p\' . p s t ' be fixed, and let V c D*(n) be a 

multiplicative set satisfying £(n. T>) = T max (/;). By (4.44) (cf. [28], Theorem 4.2), 

t 

£max(/0 = £(n,V) = n £(p e ? in \V p ) = Y\£(p s f,V Pi ). 

peF, p\n i = 1 

This implies that £(p Si , T> Pj ) = £ max (pp) for 1 < i < t. From [41], Theorem 1.1, 
we conclude that £ ma x ( p]‘) = 0 ( pf ) for all i, and the only corresponding divisor 
sets T>^ are just the ones listed in our assertion. □ 
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While equality between £ max (n) and £ max (n) does occur occasionally, we usually 
have £ max (n ) > £ max («). This is illustrated by 

Example 4.20. We have 9(p) — 2( p — 1) for each prime p. By use of Theorem 4.15, 
we easily obtain £ max (6) = 9(2)-9(3) = 8,£ max (105) = 9(3)-9(5)-9(7) = 384, and 
£ max (21) = 9(3 ) • 9(7) — 48. Numerical evaluation of (4.43) yields £ max (6) = 10, 
£ max (105) = 520, but £ max (21) = 48 = E max (2l). 


In order to be able to compare £ max (n) and £ max («) and finally prove Theorem 
4.19 completely, we start by establishing an upper bound for T max (/t). 

Proof of Theorem 4.19 (i) & (ii). (i) By (4.43), (4.28), and the well-known Holder 
identity for Ramanujan sums (cf. [4], chapt. 8.3-8.4), we have for any n and any 
divisor set V C D(n) 

" ^ ( n \ <p(%) 


S(n,V) = 

k— 1 


deV 




where // denotes the Mobius function. Let n/T> \= {7 \ d £ V} c D(n) be the set 
of complementary divisors of all d eT> with respect to n. Then 


£(n,V) = J2 


k= 1 
n 


E Ksr«) 


<P(d) 


den I'D 




iE E J^= E E >■ 


k=lden/D V({d,k)) 


den ID 


,14 <) 


fe=l 

(k,d)=g 


Since 


E * = * E > = y E > = EO 


k=l 

(k,d)=g 


r= 1 

(r,d)=g 


r= 1 

's’ 


{rA)= 1 


we conclude that 


c , ,^ <P( d ) T (d) ^ <P( d ) r (d) 

£(n,V)<n 2^ ---- 1 - 


den IV 


d\n 


Since this holds for all n, the inequality in (i) follows. Since (p, r and the identity 
function id are all multiplicative, this property is carried over to ^ and its summatory 
function 

, Y s ^(p(d)r(d) 
f(n) := ^- 


</ 


(n e N). 


(4.47) 
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Given a prime power p s , we obviously have 




O 7 - p ] l )(j + i) 


7=0 


7 = 1 


— ~ (l )( J + 1)( J + 2) H . 

2 \ p/ n 


(4.48) 


This completes the proof of (i). 

(ii) Taking into account (i), it suffices to prove the upper bound in (4.46), but in 
order to justify the remark following Theorem 4.19 we shall verify the lower bound 
as well. By (4.48), we have for all primes p and all positive integers s that 


1 

- < 
4 - 



< 


_fipA_ < 3 
(s + l) 2 “ 4 




(4.49) 


thus \t(p s ) 2 < f(p s ) < |r (p s ) 2 . By the multiplicativity of r(«) and the additivity 
of co(n), which implies the multiplicativity of c" for any positive constant c, this 
proves (4.46). □ 


Remark 1. It is easy to see that ^+^2 comes close to the lower bound | in (4.49) for 

p = 2 and large s and close to the upper bound | for large p and s = 1. Hence, the 
lower bound in (4.46) is approached for integers n that are high powers of 2, while 
the upper bound is approached for squarefree integers n having large prime factors. 


Proposition 4.21. Let p be a prime, and let s be a positive integer. Then 


g(p s ) 


P s f(p s ) ( ^r0 + 2) for 2 \ s, 

g ma AP S ) ~ | P ~W • (s+1) t f2> f or 2 I s 


for the function f defined in (4.47). More precisely, we have in particula r g( 2) = 2, 
g(p 2 ) < 3 andg( 2 4 ) = f|. 


Proof By [41], Theorem 1.1, we know that £ lm rAp s ) = ^max(/^') = (/7 s ) as de¬ 

fined in Theorem 4.15. 

Case 1: 2 \ s. 

By use of (4.48) we obtain 

(i(! - +!)(* +2)+ iW+!) 2 

s( v s ) < - --- 

(5 + l )(/> 2 - 1 ) + 2 />- 

(i ~ p)(r + l )(.s + 2)(p + i) 2 -)-1 

2{s+\)(p 2 -\) 2 p ‘ ? + 2) - 

Observe that the second inequality is an identity for p — 2 and .v = 1. 
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Case 2: 


2 | 

Applying again (4.48), we conclude 


g(p s ) < 


< 


+!)(* +2)+ })(/> +l) 2 

s(p 2 - 1) + 4 p-^ + js 

(1 “ p)( s + + 2 )(P + l) 2 p + 1 

2s(p 2 — 1) 2 p 


(5 + 1)(5 + 2) 
5 


The special values for g(p s ) are the results of straightforward computations. □ 


Corollary 4.22. Suppose that n is a positive integer with prime factorisation n — 
/tj 1 . p’i‘. Then we have 


*H n) < r (n) n P ‘ + ' 
£max(») f = i 2 Pi 


n(>+ 

1 = 1 

2 A/ 


1 


•Sj + 1 


)n( 



2 h/ 


Proof. By Theorem 4.19 (i), Theorem 4.15, and Proposition 4.21, 


£max(«) < nf(n) = A Si 

£nax(«) “ SmM } = \ 1 

^n^nfa+«n fe+l f ,+2) - 

i = 1 11 i= 1 i = l 1 

2 f v / 21 .v / 


(4.50) 


Since r (n) — I~[- =1 (5,- + 1), the corollary follows. □ 

Proof of Theorem 4.19 (iii). We cannot use Corollary 4.22 directly, but by (4.50) we 
know that 

^al< rr j ( ^<">). 

Since r(«) = n p sp,/)|«( e p( n ) + 1)’ it suffices to show that g(p s ) < s + 1 for any 
prime power p s . This will be verified by use of the different bounds obtained in 
Proposition 4.21. 

Case 1: 2 \ s. 

We have g(p s ) < + 2) < s + 1 for all p and s except for p = 2, 

5 — 1, but g(2) = 2 closes the gap. 

Case 2: s — 2. 

This case is settled by the fact that g(p 2 ) < 3. 
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Case 3: 2 | s and s > 4. 

Here we have g{p s ) < • 0+!K^+2) < $■ _|_ ] f or all p and s except for 

p = 2, s = 4, but we know that g(2 4 ) = || < 5. □ 

Besides multiplicativity, our proofs concerning £ mm (n) will be based on knowl¬ 
edge about the second largest modulus of the eigenvalues of ICG(n,X>) (cf. Re¬ 
mark 4.17 (iii)), i.e., about 

A(n,T>) := max{|A| : A e Spec(ICG(«,X>)), |A| < 0(/7,X>)}, (4.51) 

which we only define if ICG(«, V) has eigenvalues differing in modulus. It will be 
crucial for us to gather some facts about A (n , V). The first step is 


Lemma 4.23. Let n be a positive integer and T> c D(n) with n e T). 

(i) Then £(n, D) > n. 

(ii) ICG(/j, V) has a negative eigenvalue if and only if £(n. D) > n. 


Proof From linear algebra we know that YHk= i ^ki n • 2?) equals the trace of the ad¬ 
jacency matrix of ICG(«, V). Since n e V by hypothesis, ICG(n,X>) has a loop 
at every vertex, i.e., all diagonal entries of its adjacency matrix are 1. Hence, 
12k=i Afc(«, 2?) = n, and consequently 


£(n,V) = J2 \hin,V)\ > \J2h(n,V) 

k—\ k—l 


= n, 


(4.52) 


which proves (i). Equality in (4.52) only holds if all A^(n. D) have the same sign. 
We know that A„ ( n ,T>) — $(/;, T>) > 0, and this implies (ii). □ 


Proposition 4.24. Let p s be a prime power and V c D(p s ) with p s e 'D, and set 
r:=\V\. 

(i) For r — 1, i.e., T> = {p s }, we have Spec(ICG(n,X>)) = {1}. 

(ii) Let r > 2. Then V is uni-regular if and only if A(p s ,T>) — 0. In this case the 
maximal eigenvalue <$>(p s ,T>) = p s ~ a 1 = p r ~ x has multiplicity p a] . 

(iii) If r > 2 and T> is not uni-regular, then £(p s ,T>) > p s . 

Proof. Let V — {p ai , p ai ,... , p ar ~ l , p Ur } with 0 < < a 2 < ••• < a r -i < 

a r = s. Hence in case r — 1, that is V — {p s }, we trivially have A k(p s ,V) = 
c(k, 1) = 1 for all k, which proves (i). 

Henceforth we assume r > 2. On setting j := e p (k). we apply Lemma 4.4 and 
distinguish two cases. 
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Case 1: s — at < j < s — an-\ — 2 for some 1 < l < r, where a 0 := —2. 

Using the notation V(x) := {d e 'D : d > x}, we obtain from Lemma 4.4 

that 

r 

A k (p*,V)= £ <p(p s - ai ) = ®(p s ,- D (p a *)), (4.53) 

i — 1 
A* 

with d>(n , D) as defined in (4.27). 

Case 2: j = s — ai — 1 for some 1 < £ < r — 1. 

Now Lemma 4.4 yields 

r 

A k ( p s ,V)= J2 (P(p s ~ ai )-P s ~ ae ^ = $(p , ,% at+1 ))-/" 8rl . 

r = l 

(4.54) 

In order to prove (ii), we first assume that ? /r s-r+2 ,..., p s } 

is uni-regular. We observe that for ai = ai-\ + 1 (2 < l < s) the corresponding 
interval considered in Case 1 is empty. Hence Case 1 occurs only if l — 1, i.e., for 
s — a\ < j < s, and then 

A k(p\V) = $(/,%“')) = = \ p s(p s ,V), (4.55) 

which is the largest eigenvalue. This reflects the phenomenon that the largest eigen¬ 
value has multiplicity greater than 1 if a i >0, that is, the elements of V are not co 
prime or, equivalently, ICG (n,V) is disconnected (see Remark 4.17 (i)). More pre¬ 
cisely, we have for each j = s — u, 0 < u < a\, exactly <p(p u ) integers k = phn 
with p \ m and 1 < k < p s . Hence the multiplicity of the largest eigenvalue 
Q(p s , T>) is precisely Ylu=o ( P(P U ) — P ax ■ By (4.55) we know that <i>(p s , T>) = 
<&(p s , V(p ai )), and since a\ = y — r+linD = {p s ~ r+l , p s ~ r+2 ,..., p s ~ l , p s }, 
the asserted formulas for <&(p s , T>) in (ii) follow. 

By the argument above, eigenvalues other than the largest one can only appear in 

Case 2. For V — {p s ~ r+l , p s ~ r+2 . p s ~ l , p s } and j = r — l — 1 (1 < l < r), 

we obtain by (4.54) 

r—i— 1 

A k {p\V) = Hp s W~ r+l+l ))-p r ~ i - l = Y, <P(P l ) - P'^ 1 = 0- 

i =o 


This proves A (p s ,T>) — 0. 

To complete the proof of (ii), it remains to show that A (p s , D) 0 for any non- 
regularsetD. Such a divisor set can be written as T> = {p ai ,..., p at , p at+x ,..., p ar } 
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with 0 < ci\ < a 2 <•••< a r — s, ag+\ — ag >2, and cii +1 — a,- = 1 for some 
1 < f < r — 1 and all i = t + 1, — 1. Then for all A: = p s ~ ae ~ 1 m, p \ m, i.e. 

j = s — ag— 1, have by (4.54) in Case 2 

r 

X k {p s ,V) = &(p s ,U(p a£+1 )) — p s ~ a z~ l = £ <p(p s ~ ai ) - <0- 

i=l +1 

(4.56) 

hence A (p s ,T>) ^ 0. 

It remains to verify (iii). Since T> is not uni-regular, we have negative eigenvalues 
by (4.56), and Lemma 4.23 (ii) proves £(p s , V) > p s . □ 

It was shown in [40], Theorem 3.1, that for a prime power p s 

£min(/) =2/(l-i). (4.57) 

Observe that the minimum is extended over divisor sets T> C D*(p s ), i.e., over 
loopless graphs. Moreover, the p s -minimal divisor sets were identified precisely as 
the singleton sets V = {/A j with 0 < j < s- 1. For our purpose we shall require 
a corresponding result for graphs lCG{p s ,T>) containing loops, that is with p s e T>. 

Proposition 4.25. Let p s be a prime power. Then 

£mm{p s ) '■= min {£{p s ,T>) : p s e V c D{p s )} = p s , (4.58) 

where the minimising divisor sets are exactly the uni-regular ones. 


The reader might notice that (4.57) implies £ mm (p' s ) < £ m m(p s ), where equality 
only holds in case p = 2. 


Proof of Proposition 4.25. For r — 1, the assertion follows immediately from 
Proposition 4.24(i). Flence assume that r > 2. By Proposition 4.24(iii), it suf¬ 
fices to show that we have £(p s ,V) — p s for each uni-regular divisor set D — 
{p a 1 , p ai + l ,..., p s ~ l , p s }. We know from Proposition 4.24(h) that ICG(/t, V) has 
only two different eigenvalues, namely <t>( p\ 'D) = p s ~ a 1 with multiplicity p ai and 
0 (consequently with multiplicity p s — p ai ). Therefore, £(p s . T>) — p ai ■ p s ~ ai = 
p s , as required. □ 


Proof of Theorem 4.16. Let V C D*(n) be a multiplicative set such that £{n,T>) = 
fminW- Then T> = ]~[; = i f° r certain divisor sets V^ C D(p s f ) (1 < i < t), 
and ^ m in(n) = \\ i = l £{p s f ,T>^) by [28], Corollary 4.1(h). By the minimality of 
£(n,T>) it follows from [40], Theorem 3.1, and our Proposition 4.25 that for 1 < 
i < t 


£(p -‘, X> (0 ) 


I 


if p.‘ £ 
if p\ l e D (i) , 
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where 'D (l 1 is either a singleton set {p “ l } for some 0 < w, < Sj — 1, or a uni-regular 
set containing p s f , respectively. This yields 

t 

£min (n) = n Yl 2 

i — 1 

and our assumption n T> implies that p\ l V (l * for at least one i. Under this 
restriction, and since p\ is the smallest of the primes involved, it is easily seen that 
the right-hand side of (4.59) becomes minimal if p\' £ V 1 - 1 * and p s f e T> {1 ] for 
2 < i < t with the corresponding divisor sets T>^ just mentioned. □ 

We confined our study of the energies of integral circulant graphs to the rather 
restricted class having multiplicative divisor sets. Yet, somewhat unexpectedly, this 
led to good bounds for £ m in(«) and £ mdX (n). On top of that, the results obtained 
by the study of multiplicative divisor sets, combined with some numerical evidence, 
encourage us to make the following two conjectures. 

Conjecture 4.26. For each integer n > 2, we have £ mm (n) = 2n (I — where p\ 
denotes the smallest prime factor ofn. 

Conjecture 4.27. Let n > 2 be an arbitrary integer. Then £(n, 'D) = £ mm (n ) implies 
that T> is a multiplicative divisor set. 

Observe that Conjecture 4.26 is a consequence of Conjecture 4.27 by Theorem 
4.16. 

In 2005 it was conjectured by So that, given a positive integer n, two graphs 
ICG(«,X>i) and ICG(/i,X> 2 ) are cospectral, that is Spec(ICG(n,X>i)) = 
Spcc(ICG(//. 'D 2 )), if and only if T>\ = V 2 . For n = p s this follows immediately 
from (4.27) by straightforward comparison of the largest eigenvalues <£>(p s ,Vi) and 
<t>( , Vf) of the two graphs. So’s conjecture was also confirmed for the slightly more 

general case n — p s q r with primes p ^ q and t e {0, 1} (cf. [10] for details), but 
its proof required the study of eigenvalues other than the largest one as well as their 
multiplicities. For arbitrary positive integers n and arbitrary divisor sets the problem 
is still open. Therefore, we suggest to study the following weaker conjecture, which 
might be more accessible. 

Conjecture 4.28. Let n be a positive integer, and let T> \,V 2 C D* (n) be two multi¬ 
plicative sets. If ICG(«, T> i) and ICG(n, T> 2 ) are cospectral, then 'D\ — T > 2 . 
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